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Identification and Expression of Insect Steroid 
Receptor MIA Sequences 

This invention was made in part with U.S. government 
5 support under Grant DCB 8405370 from the National Science 

Foundation. The U.S. government may have certain rights 
in this invention. 

FIELD OF THE INVENTION 
10 This invention relates generally to the use of 

recombinant DNA methods as applied to the nucleic acid 
sequences and polypeptides characteristic of insect 
steroid receptor superfamily members and, more 
particularly, to uses of such receptors and the DNA 
15 regulatory elements associated with genes whose 

expression they regulate for the production of proteins 
in cultured cells, and to uses of such hormone receptor 
proteins and genes in identifying new hormones that 
control insect development. 

20 

BACKGROUND OF THE INVENTION 
The temporal sequence of gene expression determines 
the nature and sequence of steps in the development of 
the adult animal from the fertilized egg. The common 

25 fruit fly, Drosophila melanoqaster , provides a favorable 

model system for studying this genetic control of 
development. Various aspects of Drosophila development 
are representative of general insect and, in many 
respects, vertebrate development. 

3 0 The steroid hormone 20-OH ecdysone, also known as 

S-ecdysone, controls timing of development in many 
insects. See, generally , Koolman (ed.), Ecdysone: From 
Chemistry to Mode of Action , Thieme Medical Pub., N.Y. 
(1989) , which is hereby incorporated herein by reference. 

35 The generic term "ecdysone" is frequently used as an 

abbreviation for 20-OH ecdysone. Pulses, or rises and 
falls, of the ecdysone concentration over a short period 
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of time in insect development are observed at various 
stages of Drosophila development. 

These stages include embryogenesis, three larval 
stages and two pupal stages. The last pupal stage ends 
with the formation of the adult fly. An ecdysone pulse 
at the end of the third, or last, larval stage pulse 
triggers the beginning of the metamorphosis of the larva 
to the adult fly. Certain tissues, called imaginal 
tissues, are induced to begin their formation of adult 
structures such as eyes, wings, and legs. 

During the larval stages of development, giant 
polytene chromosomes develop in non-imaginal larval 
tissues. These cable-like chromosomes consist of 
aggregates comprising up to about 2,000 chromosomal 
copies. These chromosome aggregates are extremely useful 
because they provide a means whereby the position of a 
given gene within a chromosome can be determined to a 
very high degree of resolution, several orders of 
magnitude higher than is typically possible for normal 
chromosomes . 

A "puff" in the polytene chromosomes is a localized 
expansion or swelling of these cable-like polytene 
chromosome aggregates that is associated with the 
transcription of a gene at the puff locus. A puff is, 
therefore, an indicator of the transcription of a gene 
located at a particular position in the chromosome. 

A genetic regulatory model was proposed to explain 
the temporal sequence of polytene puffs induced by the 
ecdysone pulse which triggers the larval-to-adult 
metamorphosis. See, Ashburner et al., "On the Temporal 
Control of Puffing Activity in Polytene Chromosomes," 
Cold Staring Harbor Svmp. Q uant. Biol. 38:655-662 (1974). 
This model proposed that ecdysone interacts reversibly 
with a receptor protein, the ecdysone receptor, to form 
an ecdysone-receptor complex. This complex would 
directly induce the transcription of a small set of 
"early" genes responsible for a half dozen immediately 
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induced "early" puffs. These early genes are postulated 
to encode regulatory proteins that induce the 
transcription of a second set of "late" genes responsible 
for the formation of the "late" puffs. The model thus 
5 defines a genetic regulatory hierarchy of three ranks, 

the ecdysone-receptor gene in the first rank, the early 
genes in the second rank, and the late genes in the 
third. While this model was derived from the puffing 
pattern observed in a non-imaginal tissue, similar 
10 genetic regulatory hierarchies may also determine the 

metamorphic changes in development of imaginal tissues 
that are also targets of ecdysone, as well as the changes 
in tissue development induced by the pulses of ecdysone 
that occur at other developmental stages. 
15 Various structural data have been derived from 

receptors for vertebrate steroids and other lipophilic 
receptor proteins. A "super family" of such receptors has 
been defined on the basis of their structural 
similarities. See , Evans, "The Steroid and Thyroid 
20 Hormone Receptor Superf amily , " Science 240:889-895 

(1988) ; Green and Chambon, "Nuclear Receptors Enhance Our 
Understanding of Transcription Regulation," Trends in 
Genetics 4:309-314 (1988), both of which are hereby 
incorporated herein by reference. Where their functions 
25 have been defined, these receptors, complexed with their 

respective hormones, regulate the transcription of their 
primary target genes, as proposed for the ecdysone 
receptor in the above model. 

Cultivated agriculture has greatly increased 
30 efficiency of food production in the world. However, 

various insect pests exploit cultivated sources of food 
to their own advantage. These insect pests typically 
develop by a temporal sequence of events characteristic 
of their order. Many, including Drosophila . initially 
3 5 develop in a caterpillar or maggot-like larval form. 

Thereafter, they undergo a metamorphosis from which 
emerges an adult having characteristic anatomical 
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features. Anatomic similarity is a reflection of 
developmental, physiological and biochemical similarities 
shared by these insects. In particular, the principles 
governing the role of insect ecdysteroid-hormone 
receptors in Drosophila development, as described above, 
likely are shared by many different types of insects. 

As one weapon against the destruction of cultivated 
crops by insects, organic molecules with pesticidal 
properties are used commonly in attempts to eliminate 
insect populations. However, the ecological side effects 
of these pesticides, due in part to their broad activity 
and lack of specificity, and also in part to the fact 
that some of these pesticides are not easily 
biodegradable, significantly affect populations of both 
insects and other species of animals. Some of these 
populations may be advantageous from an ecological or 
other perspective. Furthermore, as the insect 
populations evolve to minimize the effects of the applied 
pesticides, greater amounts of pesticides must be 
applied, causing significant direct and indirect effects 
on other animals, including humans. Thus, an important 
need exists for both highly specific and highly active 
pesticides which are biodegradable. Novel insect 
hormones which, like the ecdysteroids, act by complexing 
with insect members of the steroid receptor superfamily 
to control insect development, are likely candidates for 
pesticides with these desirable properties. 

The use of insect hormones may also have other 
important applications. Many medically and commercially 
important proteins can be produced in a usable form by 
genetically engineered bacteria. However, many expressed 
proteins are processed incorrectly in bacteria and are 
preferably produced by genetically engineered eucaryotic 
cells. Typically, yeast cells or mammalian tissue- 
culture cells are used. Because it has been observed 
that protein processing of foreign proteins in yeast 
cells is also frequently inappropriate, mammalian 
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cultured cells have become the central focus for the 
production of many proteins. It is commonly known that 
the production of large amounts of foreign proteins makes 
these cells unhealthy, which may affect adversely the 
yield of the desired protein. This problem may be 
circumvented, in part, by using an inducible expression 
system. In such a system, the cells are engineered so 
that they do not express the foreign protein until an 
inducing agent is added to the growth medium. In this 
way, large quantities of healthy cells can be produced 
and then induced to produce large amounts of the foreign 
protein. Unfortunately, in the presently available 
systems, the inducing agents themselves, such as metal 
ions or high temperature, adversely affect the cells, 
thus again lowering the yield of the desired foreign 
protein the cells produce. A need therefore exists for 
the development of benign inducing factors for efficient 
production of recombinant proteins. Such factors could 
also prove invaluable for the therapy of human patients 
suffering from inability to produce particular proteins, 
treatment with these factors controlling both the timing 
and the abundance of the protein produced in the affected 
individual . 

The hormones that complex with mammalian or other 
vertebrate members of the steroid receptor superfamily 
are unlikely candidates as such benign factors because 
they would alter the expression of many target genes in 
cells bearing these receptors, thereby adversely 
affecting the host cells. 

For these and other reasons, obtaining steroid 
receptors or nucleic acid information about them has been 
a goal of researchers for several years. Unfortunately, 
efforts have been unsuccessful despite a significant 
investment of resources. The absence of information on 
the structure and molecular biology of steroid receptors 
has significantly hindered the ability to produce such 
products . 
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Thus, there exists a need for detailed sequence 
information on insect members of the steroid receptor 
super family, and the genes that encode these receptors 
and for resulting reagents. Reagents are provided which 
are useful in finding new molecules which may act as 
agonists or antagonists of natural insect members of the 
steroid receptor superfamily, or as components of systems 
for highly specific regulation of recombinant proteins in 
mammalian cells. 

SUMMARY OF THE INVENTION 

In accordance with the present invention, isolated 
recombinant nucleic acids are provided which, upon 
expression, are capable of coding for other than a native 
vertebrate steroid receptor or fragment thereof. 
These nucleic acids typically comprise a segment having a 
sequence substantially homologous to one or more coding 
regions of domains A, B, D, E, or F from an insect 
steroid receptor superfamily member gene having steroid 
binding domain homology, e.g., EcR, DHR3 , E75A, or E75B. 
Preferably, the .nucleic acids encode a polypeptide 
capable of binding to a ligand for an insect steroid 
receptor superfamily member and are capable of 
hybridizing to an insect steroid receptor superfamily 
member gene segment under selective hybridization 
conditions, usually stringent hybridization conditions. 
Mammalian cells transformed with the nucleic acids are 
also provided. 

In another embodiment, isolated recombinant nucleic 
acids are included that have sequence exhibiting identity 
over about 20 nucleotides of a coding segment of an 
insect steroid receptor superfamily member having steroid 
binding domain homology. The nucleic acids can be 
transformed into cells to express a polypeptide which 
binds to a control element responsive to a ligand of an 
insect steroid receptor superfamily. 
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Alternatively, an isolated DNA molecule is provided 
comprising a DNA sequence capable of binding to an insect 
steroid receptor superfamily member other than 20-OH 
ecdysone receptor, such as DHR3, E75A, or E75B. The DNA 
sequence will generally be present in an expression 
vector and promote transcription of an operably linked 
sequence (e.g., encoding a polypeptide) in response to 
binding by an insect steroid receptor superfamily member. 
Cells comprising the nucleic acids are provided, as are 
cells expressing the polypeptides. In certain 
embodiments, non-insect cells will be used, including 
mammalian cells. Also contemplated are recombinant 
nucleic acids comprising a controlling element responsive 
to a ligand, e.g., a ligand which binds to an insect 
steroid receptor superfamily member ligand responsive 
controlling element, a non-heat shock promoter sequence 
(e.g., an alcohol dehydrogenase promoter) and a sequence 
comprising a reporter gene. Usually the controlling 
element will operate to make transcription of the 
reporter gene responsive to the presence of the ligand. 

Additional embodiments of the present invention 
include polypeptides comprising an insect steroid 
receptor superfamily member or fragment thereof, wherein 
such polypeptide is substantially free of naturally- 
associated insect cell components and exhibits a 
biological activity characteristic of an insect steroid 
receptor superfamily member with a hormone binding 
domain. Preferably, the insect steroid receptor 
superfamily member or fragment thereof also comprises a 
DNA binding domain and the polypeptide is capable of 
binding to a hormone analogue selected from the group 
consisting of an insect hormone, an insect hormone 
agonist and an insect hormone antagonist. The 
polypeptide can comprise a zinc-finger domain and usually 
is capable of binding to a DNA controlling element 
responsive to an insect hormone. As desired, the 
polypeptide will be fused to a second polypeptide, 
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typically a heterologous polypeptide which comprises a 
second steroid receptor super family member. Cells, often 
mammalian cells, comprising the protein are provided. 

Fragments of such polypeptides can have a sequence 
substantially homologous to consensus El, E2 or E3 region 
sequences. By way of example, a preferred fragment has a 
sequence comprising: 

a segment at least about 25% homologous to a 
consensus El region sequence; 

a segment at least about 30% homologous to a 
consensus E2 region sequence; and 

a segment at least about 30% homologous to a 
consensus E3 region sequence. 

The polypeptides of the present invention have a 
variety of utilities. For example, a method for 
selecting DNA sequences capable of being specifically 
bound by an insect steroid receptor superfamily member 
can comprise the steps of screening DNA sequences for 
binding to such polypeptides and selecting DNA sequences 
exhibiting such binding. Alternatively, methods for 
selecting ligands, e.g., ecdysteroid analogues, specific 
for binding to a hormone binding domain of an insect 
steroid receptor superfamily member can comprise the 
steps of screening compounds for binding to one or more 
superfamily members and selecting compounds exhibiting 
specific binding to the members. 
Also included are methods for modulating insect 
physiology or development (e.g., killing) comprising the 
steps of screening compounds for binding to an insect 
steroid receptor superfamily member, selecting compounds 
exhibiting said binding and administering the ligand to 
an insect. 

Additionally provided are methods for selecting 
ligands specific for binding to a ligand binding domain 
of an insect steroid receptor superfamily member 
comprising combining: 
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(i) a fusion polypeptide which comprises a 

ligand binding domain functionally 
linked to a DNA binding domain of a 
second steroid receptor superfamily 
member; and 

(ii) a second nucleic acid sequence encoding a 

second polypeptide, wherein 
expression of the second nucleic acid 
sequence is responsive to binding by 
the DNA binding domain; 
screening compounds for an activity of inducing 
expression of the second polypeptide; and 

selecting those compounds which do so. 
This will often be performed in a cell, e.g., with cells 
transformed with DNA encoding a fusion protein. This 
method allows selction of analogues which are useful in 
modulating insect physiology or development. 

Also provided are methods for producing a 
polypeptide comprising the steps of: 

selecting a cell, typically a mammalian or plant 
cell which is substantially insensitive to 
exposure of an insect steroid receptor 
superfamily ligand; 
introducing into said cell: 

(i) a receptor for the ligand; and 

(ii) a nucleic acid sequence encoding the 

polypeptide, the nucleic acid 
sequence operably linked to a 
controlling element responsive 
to presence of the selected 
ligand, wherein a transformed 
cell is produced; and 
exposing the transformed cell to the ligand. 
Usually the cell will be a mammalian cell, and will 
sometimes be introduced into a whole organism, e.g., a 
plant or animal. 
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Kits for the determination of expression levels of 
the nucleic acids and proteins provided herein are made 
available. Typically, the kit will have at least one 
compartment which contains a reagent which specifically 
5 binds to the desired target molecule, e.g., ligand 

analogues, receptors, or nucleic acids. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1. pMTEcR, a Cu 2+ -inducible EcR expression plasmid. 
The Pmt, EcR ORF and Act 5c poly A elements are defined in 
Experimental Example III, part A. The HYG r ORF confers 
hygromycin resistance and is under control of the 
promoter in the LTR of Drosoohila transposable elements, 
cop_ia. The SV40 intron/poly A element provides an intron 
for a possible splicing requirement, as well as a 
polyadenylation/cleavage sequence for the HYG r ORF mRNA. 
The pAT153 DNA derives from a bacterial plasmid. 
Figure 2. The ecdysone-inducible pEcRE/Adh/Bgal reporter 
plasmid. See the text of Experimental Example III, part 
B, for the construction of this plasmid and the 
definitions of all symbols (except the SV40 splice and 
poly A) which are defined in the figure legend. 
Figure 3. The constitutive EcR expression plasmid, 
pActEcR. The construction of this plasmid and the 
definition of the symbols are given in Experimental 
Example III, part B. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
The present invention provides novel isolated 
3 0 nucleic acid sequences encoding polypeptide products 

exhibiting the structure and/or activities of insect 
members of the steroid receptor superfamily. Having 
elucidated the structures of these insect steroid 
receptors from their genes, the separate ligand-binding 
3 5 domains and DNA-binding domains are used individually or 

in combination to screen for new ligands or DNA sequences 
which bind to these domains. Thus, for example, by 



15 



20 
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binding to promotor sequences incorporating a DNA binding 
site, these receptors will usually control expression of 
reporter genes for which sensitive assays exist. Or, the 
hormone-binding domains serve as reagents for screening 
5 for agonists or antagonists of steroid receptor 

superfamily members. Either new classes of molecules, or 
selected modifications of known ligands will be screened 
for receptor binding. New ligands obtained in this way 
find use as highly specific and highly active naturally 

10 occurring pesticides. Alternatively, structural 

information about interactions between the ligand and 
binding domains directs methods for mutagenizing or 
substituting particular residues in the binding domains, 
thereby providing for altered binding specificity. Thus, 

15 inter alia , the present invention provides for screening 

for new ligand molecules, for the design of new ligand- 
binding domain interactions, for producing novel chimeric 
steroid receptor superfamily members and for generating 
new combinations of ligands and binding domains. 

20 The present invention also provides for the 

isolation or identification of new steroid hormone- 
responsive elements and associated genes. By appropriate 
operable linkage of selected sequences to DNA controlling 
elements which are responsive to binding by the DNA- 

25 binding domains of steroid receptor superfamily members, 

new regulatory combinations result. The present 
invention further provides for the design of either a 
binding domain in a member of the insect steroid receptor 
superfamily that will recognize given DNA sequences, or 

3 0 conversely for the modification of DNA sequences which 

will bind to particular receptor DNA-binding domains. 
Both the DNA-binding domain of a superf amily-member 
polypeptide and its DNA recognition sequence can be 
coordinately modified to produce wholly new receptor-DNA 

35 interactions. 

In an alternative embodiment, a DNA-binding sequence 
recognized by a selected receptor will be operably linked 
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to a desired genetic sequence for inducible expression. 
Thus, upon administration of a ligand specific for that 
selected receptor, the genetic sequence is appropriately 
regulated. Expression systems are constructed that are 
responsive to administration of insect steroid receptor 
superfamily-specif ic ligands. By identifying and 
isolating new members of the insect steroid receptor 
superfamily, new, useful regulatory reagents become 
available, both hormones and controlling elements. 

In another embodiment, highly regulatable expression 
of a gene is achieved by use of regulatory elements 
responsive to ligands specific to the superfamily 
members. If transformed cells are grown under conditions 
where expression is repressed or not induced, the cells 
will grow to higher densities and enjoy less stressful 
conditions. Upon reaching high density, the regulatory 
ligand molecule is added to cause high expression. 
Selected cells otherwise insensitive to the inducing 
ligand will not be affected by exposure to the ligand 
used to regulate expression. This provides a means both 
for highly efficient regulatable expression of genes, and 
for introduction of these genes into intact organisms. 

In accordance with specific embodiments of the 
present invention, nucleic acid sequences encoding 
portions of insect steroid hormone receptor superfamily 
members have been elucidated. DNA encoding four 
different members of the Drosophila steroid receptor 
superfamily have been characterized: (1) the 20-OH 
ecdysone receptor, also called the ecdysone receptor 
(EcR) , for which a full-length encoding sequence has been 
determined; (2) Drosophila hormone receptor 3 (DHR3), a 
protein with sequence homology to various steroid 
receptor superfamily members; (3 and 4) E75A and E75B, 
closely related proteins, encoded by segments of the same 
gene, and each possessing sequence homology to other 
steroid receptor superfamily members. 
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The DNA sequences encoding each of these members of 
the insect steroid receptor superfamily provide probes 
for screening for homologous nucleic acid sequences, both 
in Drosophila and other sources. This screening allows 
5 isolation of homologous genes from both vertebrates and 

invertebrates. Production of large amounts of the 
encoded proteins is effected by inserting those sequences 
into expression systems. 

The EcR, DHR3, E75A, and E75B genes are each linked 

10 to similar DNA sequences which likely function as 

controlling, or regulatory elements which are responsive 
to insect steroids. The present invention provides for 
the isolation of these hormone-responsive control 
elements, and for their use in regulating gene 

15 expression. One embodiment of a DNA construct comprises: 

(1) multiple copies of an insect steroid receptor 
superfamily controlling element linked to (2) a minimal 
gene promoter, preferably not a heat shock gene promoter, 
which provides highly inducible expression of (3) an 

20 operably linked gene. This construct provides a very 

sensitive assay for the presence of the controlling 
molecule of the receptor. 

Another aspect of the present invention involves 
cells comprising: (1) isolated recombinant gene segments 

25 encoding biologically active fragments of insect steroid 

receptor superfamily proteins; (2) DNA sequences which 
bind insect steroid receptors, e.g., the elements 
involved in hormone-responsive control; or (3) modified 
receptor proteins. Transformed cells are understood to 

30 include their progeny. In particular, the present 

invention provides for a system whereby expression of 
polypeptides is responsive to steroid induction. For 
instance, a system which expresses a desired protein in 
response to exposure to ecdysone analogues is constructed 

35 by operably linking a promoter having an ecdysone- 

responsive enhancer to a peptide encoding segment. 



WO 91/13167 



PCT/US91/01189 



14 

The present invention also provides insect steroid 
receptor proteins substantially free from naturally- 
associated insect cell components. Such receptors will 
typically be either full-length proteins, functional 
fragments, or fusion proteins comprising segments from an 
insect steroid receptor protein fused to a heterologous, 
or normally non-contiguous, protein domain. 

The present invention further provides a number of 
methods for utilizing the subject receptor proteins. One 
aspect of the present invention is a method for selecting 
new hormone analogues. The isolated hormone-binding 
domains specifically bind hormone ligands, thereby 
providing a means to screen for new molecules possessing 
the property of binding with high affinity to the ligand- 
binding region. Thus, a binding domain of an insect 
steroid receptor superfamily member will be used as a 
reagent to develop a binding assay. On one level, the 
binding domains are useful as affinity reagents for a 
batch or a column selective process, i.e., to selectively 
retain ligands which bind. Alternatively, a functional 
assay is preferred for its greater sensitivity to ligand- 
binding, whether a direct binding assay or an indirect 
assay in which binding is linked to an easily assayed 
function. For example, by operable linkage of an easily 
assayable reporter gene to a controlling element 
responsive to binding by an insect steroid receptor 
superfamily member, in which ligand-binding induces 
protein synthesis, an extremely sensitive assay for the 
presence of a ligand or of a receptor results. Such a 
construct useful for assaying the presence of 20-OH 
ecdysone is described below. This construct is useful 
for screening for agonists or antagonists of the 20-OH 
ecdysone ligand. 

In particular, this method allows selecting for 
ligands which bind to an "orphan" receptor, i.e., a 
receptor whose ligand is unknown. Binding domains for 
"unknown" ligands will often originate from either newly 
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identified insect steroid receptor superfamily members, 
or from mutagenesis. A hybrid receptor will be created 
with a ligand-binding domain and DNA-binding domain from 
different sources. For example, a hybrid receptor 
5 between a putative binding domain and a known DNA-binding 

domain would allow screening for ligands. An "orphan 
receptor" binding domain will be functionally linked to a 
known DNA-binding domain which will control a known 
reporter gene construct whose expression will be easily 
10 detected. This system for ligand-receptor binding 

provides an extremely sensitive assay for ligand-receptor 
interactions . 

Alternatively, the recognition of important features 
of tertiary structure and spatial interactions between a 

15 ligand-binding domain from an insect steroid receptor 

superfamily member and its ligand will allow selection of 
new combinations of ligand-binding domains with ligands. 
Either method provides for selecting unusual ligands 
which specifically bind a modified polypeptide-binding 

20 domain of a receptor. This approach allows selection of 

novel steroid hormone analogues which exhibit modified 
specificity for binding to a subgroup of steroid 
receptors . 

The present invention also provides for new and 
25 useful combinations of the various related components: 

the recombinant nucleic acid sequences encoding the 
polypeptides, the polypeptide sequences, and the DNA 
sites to which the receptors bind (i.e., the regulatory, 
or control, elements). For instance, fusing portions of 
3 0 nucleic acid sequences encoding peptides from different 

sources will provide polypeptides exhibiting hybrid 
properties, e.g., unusual control and expression 
characteristics. In particular, hybrid receptors 
comprising segments from other members of the 
35 superfamily, or from other sources, will be made. 

Combining an insect steroid receptor-responsive enhancer 
segment with a different polypeptide encoding segment 
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will produce a steroid-responsive expression system for 
that polypeptide. 

The isolation of insect steroid receptors provides 
for isolation or screening of new ligands for receptor 
binding. Some of these will interfere with, or disrupt, 
normal insect development. These reagents will allow the 
user to accelerate or decelerate insect development, for 
instance, in preparing sterile adults for release. 
Alternatively, a delay or change in the timing of 
development will often be lethal or will dramatically 
modify the ability of an insect to affect an agricultural 
crop. Thus, naturally occurring biodegradable and highly 
active molecules able to disrupt the timing of insect 
development will result. 

Furthermore, these polypeptides provide the means by 
which have been raised antibodies possessing specificity 
for binding to particular steroid receptor classes. 
Thus, reagents will be produced for determining, 
qualitatively or quantitatively the presence of these or 
homologous polypeptides. Alternatively, these antibodies 
will be used to separate or purify receptor polypeptides. 

Transcription sequences of insect steroid receptor 
superfamilv members 

The ecdysone receptor gene is a member of the 
steroid and thyroid hormone receptor gene superfamily, a 
group of ligand-responsive transcription factors. See . 
Evans (1988) Science 240:889-895; and Segraves, Molecular 
and Genetic Anal ysis of the E75 Ecdvsone-Responsive Gene 
of Drosophila mel anoaaster (Ph.D. thesis, Stanford 
University 1988) , both of which are hereby incorporated 
herein by reference for all purposes. These receptors 
show extensive sequence similarity, especially in their 
"zinc finger" DNA-binding domains, and also in a ligand 
(or hormone or steroid) binding domain. Modulation of 
gene expression apparently occurs in response to binding 
of a receptor to specific control, or regulatory, DNA 
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elements. The cloning of receptor cDNAs provides the 
first opportunity to study the molecular bases of steroid 
action. The steroid receptor superfamily is a class of 
receptors which exhibit similar structural and functional 
features. While the term insect is used herein, it will 
be recognized that the same methods and molecules will be 
derived from other species of animals, in particular, 
those of the class Insecta . or, more broadly, members of 
the phylum Arthropoda which use ecdysteroids as hormones. 
Members of the insect steroid receptor superfamily are 
characterized by functional ligand-binding and DNA 
binding domains, both of which interact to effect a 
change in the regulatory state of a gene operably linked 
to the DNA-binding site of the receptor. Thus, the 
receptors of the insect steroid receptor superfamily seem 
to be ligand-responsive transcription factors. The 
receptors of the present invention exhibit at least a 
hormone-binding domain characterized by sequence homology 
to particular regions, designated El, E2 and E3 . 

The members of the insect steroid receptor 
superfamily are typically characterized by structural 
homology of particular domains, as defined initially in 
the estrogen receptor. Specifically, a DNA-binding 
domain, C, and a ligand-binding domain, E, are separated 
and flanked by additional domains as identified by Krust 
et al. (1986) EMBO J. 5:891-897, which is hereby 
incorporated herein by reference. 

The C domain, or zinc-finger DNA-binding domain, is 
usually hydrophilic, having high cysteine, lysine and 
arginine content — a sequence suitable for the required 
tight DNA binding. The E domain is usually hydrophobic 
and further characterized as regions El, E2 and E3 . The 
ligand-binding domains of the present invention are 
typically characterized by having significant homology in 
sequence and structure to these three regions. Amino 
proximal to the C domain is a region initially defined as 
separate A and B domains. Region D separates the more 
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conserved domains C and E. Region D typically has a 
hydrophilic region whose predicted secondary structure is 
rich in turns and coils. The F region is carboxy 
proximal to the E region ( see , Krust et al., supra). 

The ligand-binding domain of the members of the 
insect steroid receptor superfamily is typically 
carboxyl -proximal , relative to a DNA-binding domain 
described below. See , Evans (1988) Science 240:889-895. 
The entire hormone-binding domain is typically between 
about 200 and 250 amino acids but is potentially shorter. 
This domain has the subregions of high homology, 
designated the El, E2 and E3 regions. See , e.g., 
Table 4. 

The El region is 19 amino acids long with a 
consensus sequence AKX(L/I) PGFXXLT (L/I) (D/E) DQITLL, where 
X represents any amino acid and the other letters are the 
standard single-letter code. Positions in parentheses 
are alternatives. Typically, members of the insect 
steroid receptor superfamily will have at least about 
five matches out of the sixteen assigned positions, 
preferably at least about nine matches, and in more 
preferred embodiments, at least about ten matches. 
Alternatively, these insect steroid receptor superfamily 
members will have homologous sequences exhibiting at 
least about 25% homology, normally at least about 3 0% 
homology, more normally at least about 35% homology, 
generally at least about 40% homology, more generally at 
least about 45% homology, typically at least about 50% 
homology, more typically at least about 55% homology, 
usually at least about 60% homology, more usually at 
least about 70% homology, preferably at least about 80% 
homology, and more preferably at least about 90% homology 
at positions assigned preferred amino acids. 

The E2 region is a 19 amino-acid segment with a 
consensus sequence: 

E(F/Y) (A/V) (L/C) (L/M)KA(I/L) (V/L)L(L/I) 
(N/S) (S/P)D(P/-) (R/K) (P/D)GL , 
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where - represents an optional absence of an amino acid. 
Typically, an insect steroid receptor superfamily member 
will exhibit at least about six matches, preferably at 
least about eight matches and more preferably at least 
about nine matches. Alternatively, E2 sequences of 
insect steroid receptor superfamily members exhibit at 
least about about 25% homology, normally at least about 
30% homology, more normally at least about 35% homology, 
generally at least about 40% homology, more generally at 
least about 45% homology, typically at least about 50% 
homology, more typically at least about 55% homology, 
usually at least about 60% homology, more usually at 
least about 70% homology, preferably at least about 80% 
homology, and more preferably at least about 90% homology 
at positions assigned preferred amino acids. 

The E3 region is a 12 amino-acid segment with a 
consensus sequence 

LXKLLXXLPDLR . 
The insect steroid receptor superfamily members will 
typically show at least about four matches out of the 
nine assigned preferences in the E3 region, preferably at 
least about five matches and more preferably at least 
about six matches. Alternatively, over the assigned 
positions, members of the insect steroid receptor 
superfamily will exhibit at least about 25% homology, 
normally at least about 30% homology, more normally at 
least about 35% homology, generally at least about 40% 
homology, more generally at least about 45% homology, 
typically at least about 50% homology, more typically at 
least about 55% homology, usually at least about 60% 
homology, more usually at least about 70% homology, 
preferably at least about 80% homology, and more 
preferably at least about 90% homology at positions 
assigned preferred amino acids. 

In preferred embodiments, the insect steroid 
receptor superfamily members will exhibit matching of at 
least about five positions in an El region, at least 
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about six positions in an E3 region and at least about 
four positions in an E3 region. The El, E2, and E3 
regions are defined, e.g., in Table 4. 

The DNA-binding domain of these insect steroid 
receptor superfamily members is characterized by a "zinc 
fingers" motif. See , Evans (1988) Science 240:889-895. 
The domain is typically amino proximal to the ligand, or 
hormone, binding site. Typically, the DNA-binding domain 
of the insect steroid receptor superfamily members is 
characterized by clustering of basic residues, a 
cysteine-rich composition and sequence homology. See , 
Evans, R. M. (1988) Science 240:889-89; and Experimental 
section below. Significant polypeptide sequence homology 
among superfamily members exists. The insect steroid 
receptor superfamily members will exhibit at least about 
3 0% homology in the 67+1 amino acid region of this 
domain, normally at least about 40% homology, usually at 
least about 45% homology, and preferably at least about 
55% homology. 

Steroids are derivatives of the saturated 
tetracyclic hydrocarbon perhydrocyclopentanophenanthrene. 
Among the molecules in the group "steroids" are: the bile 
acids, e.g., cholic acid, and deoxycholic acid; the 
adrenocortical steroids, e.g., corticosterone and 
aldosterone; the estrogens, e.g., estrone and B- 
estradiol; the androgens, e.g., testosterone and 
progesterone; and the ecdysteroids . The terms steroid or 
steroid hormones are used interchangeably herein and are 
intended to include all steroid analogues. Typically, 
steroid analogues are molecules which have minor 
modifications of various peripheral chemical groups. 
See . Koolman (ed. ) (1989), cited above, for details on 
ecdysteroids . 

Although ligands for the insect steroid receptor 
superfamily members have historically been characterized 
as steroids, the term "steroid" as in "insect steroid 
receptor superfamily" is not meant only literally. The 
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use of "steroid" has resulted from a historical 
designation of members of a group recognized initially to 
include only molecules having specific defined molecular 
structures. However, this limitation is no longer 
5 applicable since functions are no longer only correlated 

with precise structures. Thus, there will be members of 
the insect steroid receptor super family, as defined 
herein, whose ligand-binding specificities are not 
directed to classically defined "steroids." Typically, 

10 the ligands for members of the insect steroid receptor 

superfamily are lipophilic molecules which are structural 
analogues of steroid molecules. 

The term ligand is meant to refer to the molecules 
that bind the domain described here as the "hormone- 

15 binding domain." Also, a ligand for an insect steroid 

receptor superfamily member is a ligand which serves 
either as the natural ligand to which the member binds, 
or a functional analogue which serves as an agonist or 
antagonist. The classical definition of "hormone" has 

20 been defined functionally by physiologists, see, e.g., 

Guyton, Textbook o f Medical Physiology, Saunders, 
Philadelphia. The functional term "hormone" is employed 
because of historic usage, but is meant to apply to other 
chemical messengers used to communicate between cell 

25 types. Recently the distinction between hormones and 

neurotransmitters has been eroded as various peptide 
neurotransmitters have been shown to exhibit properties 
of classically defined hormones. These molecules are 
typically used in intercellular signal transduction, but 

3 0 are not limited to those molecules having slow or 

systemic effects, or which act at remote sites. 

Substantial homology in the nucleic acid context 
means either that the segments, or their complementary 
strands, when compared, are identical when optimally 

35 aligned, with appropriate nucleotide insertions or 

deletions, in at least about 40% of the residues, 
generally at least about 45%, more generally at least 
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about 50%, normally at least about 55%, more normally at 
least about 60%, typically at least about 65%, more 
typically at least about 70%, usually at least about 75%, 
more usually at least about 80%, preferably at least 
5 about 85%, and more preferably at least about 95% of the 

nucleotides. Alternatively, substantial homology exists 
when the segments will hybridize under selective 
hybridization conditions, to a strand or its complement, - 
typically using a sequence derived from Table 1, 2 or 3. 

10 Selectivity of hybridization exists when hybridization 

occurs which is more selective than total lack of 
specificity. Normally, selective hybridization will 
occur when there is at least about 55% homology over a 
stretch of at least about 14 to 25 nucleotides, generally 

15 at least about 65%, typically at least about 75%, usually 

at least about 85%, preferably at least about 90%, and 
more preferably at least about 95% or more. See . 
Kanehisa, M. (1984), Nucleic Acids Res. 12:203-213, which 
is incorporated herein by reference. Stringent 

20 hybridization conditions will include salt concentrations 

of less than about 2.5 M, generally less than about 1.5 
M, typically less than about 1 M, usually less than about 
500 mM, and preferably less than about 200 mM. 
Temperature conditions will normally be greater than 

25 20°C, more normally greater than about 25 C C, generally 

greater than about 30°C, more generally greater than 
about 35"C, typically greater than about 40°C, more 
typically greater than about 45 usually greater than 
about 50 'C, more usually greater than about 55 'C, and in 

30 particular embodiments will be greater than 60 °C, even as 

high as 80 8 C or more. As other factors may significantly 
affect the stringency of hybridization, including, among 
others, base composition and size of the complementary 
strands, presence of organic solvents and extent of base 

35 mismatching, the combination of parameters is more 

important than the absolute measure of any one. 
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A gene for an insect steroid receptor superfamily 
member gene includes its upstream (e.g., promoter) and 
downstream operably linked controlling elements, as well 
as the complementary strands. See, generally, Watson et 
5 al. (1987) The Molecular Biology of the Gene , Benjamin, 

Menlo Park, which is hereby incorporated herein by 
reference. A gene geneally also comprises the segment 
encoding the transcription unit, including both introns 
and exons. Thus, an isolated gene allows for screening 
10 for new steroid receptor genes by probing for genetic 

sequences which hybridize to either controlling or 
transcribed segments of a receptor gene of the present 
invention. Three segments of particular interest are the 
controlling elements, both upstream and downstream, and 
15 segments encoding the DNA-binding segments and the 

hormone-binding segments. Methods applicable to such 
screening are analogous to those generally used in 
hybridization or affinity labeling. 

Nucleic acid probes will often be labeled using 
20 radioactive or non-radioactive labels, many of which are 

listed in the section on polypeptide labeling. Standard 
procedures for nucleic acid labeling are described, e.g., 
in Sambrook et al. (1989); and Ausubel et al. (1987 and 
supplements) . 

25 

Insect steroid receptor superfamily member polypeptides 
A polypeptide sequence of the ecdysone receptor is 
represented in Table 2. Other insect steroid receptor 
superfamily member polypeptide sequences are set forth in 

30 Tables 1 and 3. Preferred nucleic acid sequences of the 

cDNAs encoding these insect steroid receptor superfamily 
member polypeptides are also provided in the 
corresponding tables. Other nucleic acids will be used 
to encode the proteins, making use of the degeneracy or 

35 non-universality of the genetic code. 

As used herein, the term "substantially pure" 
describes a protein or other material, e.g., nucleic 
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acid, which has been separated from its native 
contaminants. Typically, a monomeric protein is 
substantially pure when at least about 60 to 75% of a 
sample exhibits a single polypeptide backbone. Minor 
variants or chemical modifications typically share the 
same polypeptide sequence. Usually a substantially pure 
protein will comprise over about 85 to 90% of a protein 
sample, and preferably will be over about 99% pure. 
Normally, purity is measured on a polyacryl amide gel, 
with homogeneity determined by staining. Alternatively, 
for certain purposes high resolution will be necessary 
and HPLC or a similar means for purification will be 
used. For most purposes, a simple chromatography column 
or polyacrylamide gel will be used to determine purity. 

The term "substantially free of naturally-associated 
insect cell components" describes a protein or other 
material, e.g., nucleic acid, which is separated from the 
native contaminants which accompany it in its natural 
insect cell state. Thus, a protein which is chemically 
synthesized or synthesized in a cellular system different 
from the insect cell from which it naturally originates 
will be free from its naturally-associated insect cell 
components. The term is used to describe insect steroid 
receptor superfamily members and nucleic acids which have 
been synthesized in mammalian cells or plant cells, E. 
coli and other procaryotes. 

The present invention also provides for analogues of 
the insect steroid receptor superfamily members. Such 
analogues include both modifications to a polypeptide 
backbone, e.g., insertions and deletions, genetic 
variants, and mutants of the polypeptides. Modifications 
include chemical derivatizations of polypeptides, such as 
acetylations , carboxylations and the like. They also 
include glycosylation modifications and processing 
variants of a typical polypeptide. These processing 
steps specifically include enzymatic modifications, such 
as ubiquitinylation. See , e.g., Hershko and Ciechanover 
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(1982), "Mechanisms of Intracellular Protein Breakdown," 
Ann. Rev. Bioch. . 51:335-364. 

Other analogues include genetic variants, both 
natural and induced. Induced mutants are derived from 
various techniques, e.g., random mutagenesis using 
reagents such as irradiation or exposure to EMS, or 
engineered changes using site-specific mutagenesis 
techniques or other techniques of modern molecular 
biology. See, e.g., Sambrook et al. (1989) Molecular 
Cloning: A Laboratory Manual (2d ed.), CSH Press; and 
Ausubel et al. (1987 and supplements) Current Protocols 
in Molecul ar Biology. Greene/Wiley, New York, each of 
which is hereby incorporated herein by reference. 

As described above, the DNA-binding zinc fingers 
segment of a receptor shows high specificity of 
recognition of specific target DNA sequences. An 
understanding of the DNA-protein binding interactions 
provides for the modification in a rational manner of 
either DNA or protein characteristics, or both, to effect 
specificity of binding for modulation of enhancer 
activity. More importantly, isolation of genes for new 
members of the insect steroid receptor superfamily allows 
their use to produce the receptor polypeptides and to 
isolate new controlling elements. By using the DNA- 
binding domains, as described above, controlling elements 
which are responsive to the ligands bound by the 
corresponding superfamily members are identified and 
isolated. This procedure shall yield a variety of 
controlling elements responsive to ligands. By the 
methods described above, the ligands for any particular 
member of the insect steroid receptor superfamily will be 
identified. 

The controlling elements typically are enhancers, 
but also include silencers or various other types of 
ligand-responsive elements. They usually operate over 
large distances, but will typically be within about 
50 kb, usually within about 35 kb, more usually within 
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about 20 kb and preferably within about 7 kb of the genes 
that these elements regulate. 

Polypeptide fragments and fusions 

Besides substantially full-length polypeptides, the 
present invention provides for biologically active 
fragments of the polypeptides. Significant biological 
activities include ligand-binding, DNA binding, 
immunological activity and other biological activities 
characteristic of steroid receptor superfamily members. 
Immunological activities include both immunogenic 
function in a target immune system, as well as sharing of 
immunological epitopes for binding, serving as either a 
competitor or substitute antigen for a steroid receptor 
epitope. 

For example, ligand-binding or DNA-binding domains 
from different polypeptides will be exchanged to form 
different or new fusion polypeptides or fragments. Thus, 
new chimaeric polypeptides exhibiting new combinations of 
specificities result from the functional linkage of 
ligand-binding specificities to DNA-binding domains. 
This is extremely useful in the design of inducible 
expression systems. 

For immunological purposes, immunogens will 
sometimes be produced from tandemly repeated polypeptide 
segments, thereby producing highly antigenic proteins. 
Alternatively, such polypeptides will serve as highly 
efficient competitors for specific binding. Production 
of antibodies to insect steroid receptor superfamily 
members is described below. 

The present invention also provides for other 
polypeptides comprising fragments of steroid receptor 
superfamily members. Fusion polypeptides between the 
steroid receptor segments and other homologous or 
heterologous proteins are provided, e.g., polypeptide 
comprising contiguous peptide sequences from different 
proteins. Homologous polypeptides will often be fusions 



WO 91/13167 



PCT/US91/01189 



27 

between different steroid receptor superfamily members, 
resulting in, for instance, a hybrid protein exhibiting 
ligand specificity of one member and DNA-binding 
specificity of another. Likewise, heterologous fusions, 
5 derived from different polypeptides, will be constructed 

which would exhibit a combination of properties or 
activities of the parental proteins. Typical examples 
are fusions of a reporter polypeptide, e.g., luciferase, 
with another domain of a receptor, e.g., a DNA-binding 

10 domain, so that the presence or location of a desired 

ligand is easily determined. See , e.g., Dull et al., 
U.S. No. 4,859,609, which is hereby incorporated herein 
by reference. Other typical gene fusion partners include 
"zinc finger" segment swapping between DNA-binding 

15 proteins, bacterial 6-galactosidase, trpE Protein A, 

6-lactamase, alpha amylase, alcohol dehydrogenase, and 
yeast alpha mating factor. See , e.g., Godowski et al. 
(1988), Science 241:812-816; and Experimental section 
below. 

20 

Insect steroid receptor superfamil v member expression 

With the sequence of the receptor polypeptides and 
the recombinant DNA sequences encoding them, large 
quantities of members of the insect steroid receptor 

25 superfamily will be prepared. By the appropriate 

expression of vectors in cells, high efficiency protein 
production will be achieved. Thereafter, standard 
protein purification methods are available, such as 
ammonium sulfate precipitation, column chromatography, 

30 electrophoresis, centrifugation, crystallization, and 

others. See, e.g., Deutscher (1990) "Guide to Protein 
Purification" Methods in Enzymology . vol 182 and others; 
and Ausubel et al. (1987 and supplements) Current 
Protocols in Molecular Biology , for techniques typically 

35 used for protein purification. Alternatively, in some 

embodiments high efficiency of production is unnecessary, 
but the presence of a known inducing protein within a 
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carefully engineered expression system is quite valuable. 
For instance, a combination of: (1) a ligand-responsive 
enhancer of this type operably linked to (2) a desired 
gene sequence with (3) the corresponding insect steroid 
5 receptor superfamily member will be placed together in an 

expression system provides a specifically inducible 
expression system. The desired gene sequence will encode 
a protein of interest, and the corresponding steroid 
receptor member will often be the ecdysone receptor. 

10 Typically, the expression system will be a cell, but in 

vitro expression systems will also be constructed. 

The desired genes will be inserted into any of a 
wide selection of expression vectors. The selection of 
an appropriate vector and cell line depends upon the 

15 constraints of the desired product. Typical expression 

vectors are described in Sambrook et al. (1989) and 
Ausubel et al. (1987 and supplements). Suitable cell 
lines are available from a depository, such as the ATCC. 
See, ATCC Catalogue of Cell Lines and Hybridomas (6th 

20 ed.) (1988); ATCC Cell Lines, Viruses, and Antisera, each 

of which is hereby incorporated herein by reference. The 
vectors are introduced to the desired cells by standard 
transformation or transfection procedures as described, 
for instance, in Sambrook et al. (1989). 

25 Fusion proteins will typically be made by either 

recombinant nucleic acid methods or by synthetic 
polypeptide methods. Techniques for nucleic acid 
manipulation are described generally, for example, in 
Sambrook et al. (1989), Molecular Cloning: A Laboratory 

3 0 Manual (2d ed.), Vols. 1-3, Cold Spring Harbor 

Laboratory, which are incorporated herein by reference. 
Techniques for synthesis of polypeptides are described, 
for example, in Merri field, J. Amer. Chem. Soc. 85:2149- 
2156 (1963). 

35 The recombinant nucleic acid sequences used to 

produce fusion proteins of the present invention will 
typically be derived from natural or synthetic sequences. 
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Many natural gene sequences are obtainable from various 
cDNA or from genomic libraries using appropriate probes. 
See , GenBank™, National Institutes of Health. Typical 
probes for steroid receptors are selected from the 
5 sequences of Tables 1, 2 or 3 in accordance with standard 

procedures. The phosphoramidite method described by 
Beaucage and Carruthers, Tetra . Letts . 22:1859-1862 
(1981) will produce suitable synthetic DNA fragments. A 
double stranded fragment is then obtainable either by 

10 synthesizing the complementary strand and annealing the 

strand together under appropriate conditions or by adding 
the complementary strand using DNA polymerase with an 
appropriate primer sequence. 

With the isolated steroid receptor genes, segments 

15 of the transcribed segments are available as probes for 

isolating homologous sequences usually from different 
sources, e.g., different animals. By selection of the 
segment used as a probe, particular functionally 
associated segments will be isolated. Thus, for example, 

20 other nucleic acid segments encoding either ligand- 

binding or DNA-binding domains of new receptors will be 
isolated. Alternatively, by using steroid-responsive 
controlling elements as a probe, new steroid-responsive 
elements will be isolated, along with the associated 

25 segment of DNA whose expression is regulated. This method 

allows for the isolation of ligand-responsive genes, many 
of which are, themselves, also members of the insect 
steroid receptor super family. 

The natural or synthetic DNA fragments coding for a 

3 0 desired steroid receptor fragment will be incorporated 

into DNA constructs capable of introduction to and 
expression in an in vitro cell culture. Usually the DNA 
constructs will be suitable for replication in a 
unicellular host, such as yeast or bacteria, but 

35 alternatively are intended for introduction to, with or 

without integration into the genome, cultured mammalian 
or plant or other eucaryotic cell lines. DNA constructs 
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prepared for introduction into bacteria or yeast will 
typically include a replication system recognized by the 
host, the intended DNA fragment encoding the desired 
receptor polypeptide, transcription and translational 
initiation regulatory sequences operably linked to the 
polypeptide encoding segment and transcriptional and 
translational termination regulatory sequences operably 
linked to the polypeptide encoding segment. The 
transcriptional regulatory sequences will typically 
include a heterologous enhancer or promoter which is 
recognized by the host. The selection of an appropriate 
promoter will depend upon the host, but promoters such as 
the trp, lac and phage promoters, tRNA promoters and 
glycolytic enzyme promoters are known. See . Sambrook et 
al. (1989) . Conveniently available expression vectors 
which include the replication system and transcriptional 
and translational regulatory sequences together with the 
insertion site for the steroid receptor DNA sequence will 
generally be employed. Examples of workable combinations 
of cell lines and expression vectors are described in 
Sambrook et al. (1989),- see also , Metzger et al. (1988), 
Nature 334:31-36. 

Genetic constructs 

The DNA segments encoding the members of the insect 
steroid receptor superfamily will typically be utilized 
in a plasmid vector. In one embodiment an expression 
control DNA sequence is operably linked to the insect 
steroid receptor superfamily member coding sequences for 
expression of the insect steroid receptor superfamily 
member alone. In a second embodiment an insect steroid 
receptor superfamily member provides the capability to 
express another protein in response to the presence of an 
insect steroid receptor ligand. This latter embodiment 
is separately described below. The expression control 
sequences will commonly include eukaryotic enhancer or 
promoter systems in vectors capable of transforming or 
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trans feet ing eucaryotic host cells. Once the vector has 
been introduced into the appropriate host, the host, 
depending on the use, will be maintained under conditions 
suitable for high level expression of the nucleotide 
5 sequences . 

Steroid-responsive expression of selected genes 

For steroid-responsive expression of other genes, 
the steroid receptor gene will typically be cotransformed 

10 with a recombinant construct comprising a desired gene 

for expression operably linked to the steroid-responsive 
enhancer or promoter element. In this use, a single 
expression system will typically comprise a combination 
of (1) a controlling element responsive to a ligand of an 

15 insect steroid receptor superfamily member, (2) a desired 

gene for expression, operably linked to the controlling 
element, and (3) an insect steroid receptor superfamily 
member which can bind to the controlling element. 
Usually, this system will be employed within a cell, but 

20 an in vitro system is also possible. The insect steroid 

receptor superfamily member will typically be provided by 
expression of a nucleic acid encoding it, though it need 
not be expressed at high levels. Thus, in one preferred 
embodiment, the system will be achieved through 

25 cotrans formation of a cell with both the regulatable 

construct and another segment encoding the insect steroid 
receptor superfamily member. Usually, the controlling 
element will be an enhancer element, but some elements 
work to repress expression. In this embodiment, the 

3 0 ligand for the insect steroid receptor superfamily member 

will be provided or withheld as appropriate for the 
desired expression properties. 

A particularly useful genetic construct comprises an 
alcohol dehydrogenase promoter operably linked to an 

35 easily assayable reporter gene, e.g., 6-galactosidase. 

In a preferred embodiment of this construct, a 
multiplicity of copies of the insect steroid receptor 
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superfamily member is used. For example, operable 
linkage of controlling elements responsive to insect 
steroid receptor superfamily members, e,g., EcR, DHR3, 
E75A and E75B, to the alcohol dehydrogenase (ADH) 
promoter, or others as described above, and protein 
coding sequences for a particular reporter protein, as 
described above, leads to steroid-responsive expression 
of B-galactosidase. Such a system provides highly 
sensitive detection of expression in response to ligand 
binding, allowing for detection of a productive ligand- 
receptor interaction. 

DNA sequences will normally be expressed in hosts 
after the sequences have been operably linked to (i.e., 
positioned to ensure the functioning of) an expression 
control sequence. These expression vectors are typically 
replicable in the host organisms either as episomes or as 
an integral part of the host chromosomal DNA. Commonly, 
expression vectors will contain selection markers, e.g., 
tetracycline or neomycin, to permit detection of those 
cells transformed with the desired DNA sequences ( see , 
e.g., U.S. Patent 4,704,362, which is incorporated herein 
by reference) . 

E. coli is one procaryotic host useful for cloning 
the DNA sequences of the present invention. Other 
microbial hosts suitable for use include bacilli, such as 
Bacillus subtilis r and other enterobacteriaceae, such as 
Salmonella . Serratia, and various Pseudomonas species. 

Other eucaryotic cells will often be used, including 
yeast cells, insect tissue culture cells, avian cells, or 
the like. Preferably, mammalian tissue cell culture will 
be used to produce the inducible polypeptides of the 
present invention ( see , Winnacker, From Genes to Clones . 
VCH Publishers, N.Y. (1987), which is incorporated herein 
by reference) . Mammalian cells are preferred cells in 
which to use the insect steroid receptor superfamily 
member ligand-responsive gene constructs, because they 
naturally lack the molecules which confer responses to 
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the ligands for insect steroid receptor superf amily 
members . 

Mammalian cells are preferred because they are 
insensitive to many ligands for insect steroid receptor 
5 superf amily member. Thus, exposure of these cells to the 

ligands of the insect steroid receptor superf amily 
members typically will have negligible physiological or 
other effects on the cells, or on a whole organism. 
Therefore, cells can grow and express the desired 

10 product, substantially unaffected by the presence of the 

ligand itself. The ligand will function to cause 
response either in the positive or negative direction. 
For example, it is often desirable to grow cells to high 
density before expression. In a positive induction 

15 system, the inducing ligand would be added upon reaching 

high cell density, but since the ligand itself is benign 
to the cells, the only physiological imbalances result 
from the expression, e.g., the product, itself. 
Alternatively, in a negative repression system, the 

20 ligand is supplied until the cells reach a high density. 

Upon reaching a high density, the ligand would be 
removed. Introduction of these cells into a whole 
organism, e.g., a plant or animal, will provide the 
products of expression to that organism. In this 

25 circumstance, the natural insensitivity of cells to the 

ligands will also be advantageous. 

Expression vectors for these cells can include 
expression control sequences, such as an origin of 
replication, a promoter, an enhancer and necessary 

30 processing information sites, such as ribosome-binding 

sites, RNA splice sites, polyadenylation sites, and 
transcriptional terminator sequences. Preferably, the 
enhancers or promoters will be those naturally associated 
with genes encoding the steroid receptors, although it 

35 will be understood that in many cases others will be 

equally or more appropriate. Other preferred expression 
control sequences are enhancers or promoters derived from 
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viruses, such as SV40, Adenovirus, Bovine Papilloma 
Virus, and the like. 

Similarly, preferred promoters are those found 
naturally in immunoglobul in-producing cells ( see . U.S. 
5 Patent No. 4,663,281, which is incorporated herein by 

reference) , but SV40, polyoma virus, cytomegalovirus 
(human or murine) and the LTR from various retroviruses 
(such as murine leukemia virus, murine or Rous sarcoma 
virus and HIV) are also available. See . Enhancers and 

10 Eukarvotic Gene Expression . Cold Spring Harbor Press, 

N.Y., 1983, which is incorporated herein by reference. 

The vectors containing the DNA segments of interest 
(e.g., the steroid receptor gene, the recombinant 
steroid-responsive gene, or both) can be transferred into 

15 the host cell by well-known methods, which vary depending 

on the type of cellular host. For example, calcium 
chloride transfection is commonly utilized for 
procaryotic cells, whereas calcium phosphate treatment is 
often used for other cellular hosts. See, generally, 

20 Sambrook et al. (1989), Molecular Cloning; A Laboratory 

Manual (2d ed.), Cold Spring Harbor Press; Ausubel et al. 
(1987 and supplements) Cement Protocls in Molecular 
Biology . Greene/Wiley, New York; and Potrykus (1990) 
"Gene Transfer to Cereals: An Assessment," 

25 Bio /Technology 8:535-542, each of which is incorporated 

herein by reference. Other transformation techniques 
include electroporation, DEAE-dextran, microprojectile 
bombardment, lipofection, microinjection, and others. 
The term "transformed cell" is meant to also include the 

3 0 progeny of a transformed cell. 

As with the purified polypeptides, the nucleic acid 
segments associated with the ligand-binding segment and 
the DNA-binding segment are particularly useful. These 
gene segments will be used as probes for screening for 

35 new genes exhibiting similar biological activities, 

though the controlling elements of these genes are of 
equal importance, as described below. 
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Many types of proteins are preferentially produced 
in eucaryotic cell types because of abnormal processing 
or modification in other cell types. Thus, mammalian 
proteins are preferably expressed in mammalian cell 
cultures. Efficient expression of a desired protein is 
often achieved, as described above, by placing: (1) a 
desired protein encoding DNA sequence adjacent to 
controlling elements responsive to ligands for insect 
steroid receptor superfamily members and an appropriate 
promoter. Cyclic pulses of ligands in a cell culture may 
provide periods for cells to recover from effects of 
production of large amounts of exogenous protein. Upon 
recovery, the ligand will often be reinduced. 

Additional steroid responsive gene elements have 
also been isolated, e.g., substantially purified, using 
the techniques of the present invention. Other genes 
adjacent to, and operably linked to, steroid responsive 
gene control elements are selectable by locating DNA 
segments to which steroid receptors specifically bind or 
by hybridization to homologous controlling elements. For 
example, other steroid responsive genes have been 
isolated. Many of the genes which are ligand-responsive 
will also be new members of the insect steroid receptor 
superfamily. 

Having provided for the substantially pure 
polypeptides, biologically active fragments thereof and 
recombinant nucleic acids comprising genes for them, the 
present invention also provides cells comprising each of 
them. By appropriate introduction techniques well known 
in the field, cells comprising them will be produced. 
See , e.g., Sambrook et al. (1989). 

In particular, cells comprising the steroid 
responsive controlling elements are provided, and 
operable linkage of standard protein encoding segments to 
said controlling elements produce steroid responsive 
systems for gene expression. Cells so produced will 
often be part of, or be introduced into, intact 
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organisms, for example, plants, insects (including 
caterpillars and larvae), and higher animals, e.g., 
mammals. This provides for regulatable expression of 
desired genes where the regulating ligand has no other 
effects on the cells because the cells otherwise lack the 
receptors and responsive genes. For example, plants will 
be induced to fruit at desired times by administration of 
the appropriate ligand, or animals will be ligand - 
responsive in production of particular products. And, in 
fact, biochemical deficiencies may be overcome by ligand- 
responsive expression of cells introduced into an intact 
organism which, itself, also otherwise lacks genes 
responsive to the presence of such a ligand. Multiple 
repeats of the control elements will lead, often, to at 
least additive or synergistic control. Cells containing 
these expression systems will be used in gene therapy 
procedures, including in humans. 

Once a sufficient quantity of the desired steroid 
receptor polypeptide has been obtained, the protein is 
useful for many purposes. A typical use is the 
production of antibodies specific for binding to steroid 
receptors. These antibodies, either polyclonal or 
monoclonal, will be produced by available in vitro or in 
vivo techniques. 

For production of polyclonal antibodies, an 
appropriate target immune system is selected, typically a 
mouse or rabbit. The substantially purified antigen is 
presented to the immune system in a fashion determined by 
methods appropriate for the animal and other parameters 
well known to immunologists . Typical sites for injection 
are in the footpads, intramuscularly, intraperitoneal ly, 
or intradermally . Of course, another species will often 
be substituted for a mouse or rabbit. 

An immunological response is usually assayed with an 
immunoassay. Normally such immunoassays involve some 
purification of a source of antigen, for example, 
produced by the same cells and in the same fashion as the 
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antigen was produced. The immunoassay will generally be 
a radioimmunoassay, an enzyme-linked assay (ELISA) , a 
fluorescent assay, or any of many other choices, most of 
which are functionally equivalent but each will exhibit 
5 advantages under specific conditions. 

Monoclonal antibodies with affinities of 10 8 M -1 
preferably 10 9 to 10 10 , or stronger will typically be 
made by standard procedures as described, e.g., in Harlow 
and Lane (1988), Antibodies; A Lab oratory Manual, Cold 

10 Spring Harbor Laboratory; or Goding (1986) , Monoclonal 

Antibodies; Principles and Practice (2d ed) Academic 
Press, New York, which are hereby incorporated herein by 
reference. Briefly, appropriate animals will be selected 
and the desired immunization protocol followed. After 

15 the appropriate period of time, the spleens of such 

animals are excised and individual spleen cells fused, 
typically, to immortalized myeloma cells under 
appropriate selection conditions. Thereafter the cells 
are clonally separated and the supernatants of each clone 

20 are tested for their production of an appropriate 

antibody specific for the desired region of the antigen. 

Other suitable techniques involve in vitro exposure 
of lymphocytes to the antigenic polypeptides or 
alternatively to selection of libraries of antibodies in 

25 phage or similar vectors. See , Huse et al., (1989) 

"Generation of a Large Combinatorial Library of the 
Immunoglobulin Repertoire in Phage Lambda," Science 
246:1275-1281, hereby incorporated herein by reference. 
The polypeptides and antibodies of the present 

3 0 invention will be used with or without modification. 

Frequently, the polypeptides and antibodies will be 
labeled by joining, either covalently or non-covalently , 
a substance which provides for a detectable signal. A 
wide variety of labels and conjugation techniques are 

3 5 known and are reported extensively in both the scientific 

and patent literature. Suitable labels include 
radionuclides, enzymes, substrates, cof actors, 
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inhibitors, fluorescens, chemiluminescers , magnetic 
particles and the like. Patents, teaching the use of 
such labels include U.S. Patent Nos. 3,817,837; 
3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; 
and 4,366,241. Also, recombinant, chimeric, or 
humanized, immunoglobulins will be produced, see , e.g., 
Cabilly, U.S. Patent No. 4,816,567; Jones et al., 1986, 
Nature 321, 522-526; and published UK patent application 
No. 8707252; each of which is hereby incorporated herein 
by reference. 

Another use of purified receptor polypeptides is for 
determination of the structural and biosynthetic aspects 
of the polypeptides. Structural studies of interactions 
of the ligand-binding domains with selected ligands are 
performed by various methods. The preferred method for 
structural determination is X-ray crystallography but may 
include various other forms of spectroscopy or 
chromatography . See , e.g., Connol ly , M.L. , J. Appl. 
Crystal!^, 16:548 (1983); Connolly, M.L., Science 221:709 
(1983) ; and Blundell and Johnson (1976) Protein 
Crystallography f Academic Press, New York; each of which 
is hereby incorporated herein by reference. For example, 
the structure of the interaction between hormone ligand 
and hormone-binding segments is determined to high 
resolution. From this information, minor substitutions 
or modifications to either or both of the ligand and 
ligand-binding segment are made based upon, e.g., the 
contact regions between the two. This information 
enables the generation of modified interactions between a 
ligand and its binding segment to either increase or 
decrease affinity of binding and perhaps increase or 
decrease response to binding. Likewise, the interaction 
between the zinc finger DNA-binding segments with the 
specific nucleic acid-binding sequence will be similarly 
modified. 

As a separate and additional approach, isolated 
ligand-binding polypeptide domains will be utilized to 
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screen for new ligands. Binding assays will be 
developed, analogous, e.g., to immunoassays. This 
procedure permits screening for new agonists or 
antagonists of a particular steroid receptor. Isolated 
5 DNA-binding segments will be used to screen for new DNA 

sequences which will specifically bind to a particular 
receptor-binding segment. Typically, these receptor- 
specific binding sites will be controlling elements for 
steroid responsive genes. Thus, having isolated these 
10 DNA-binding sequences, genes which are responsive to the 

binding of a given receptor can be isolated. This 
provides a method for isolating genes which are 
responsive to induction or inhibition by a given hormone 
receptor . 

15 In another aspect of the present invention, means 

for disrupting insect development are provided where new 
ligand agonists or antagonists are discovered. These 
compounds are prime candidates as agonists or antagonists 
to interfere with normal insect development. By 

20 application of new steroid analogues of ligands for 

insect steroid receptor superfamily members, it is 
possible to modify the normal temporal sequence of 
developmental events. For example, accelerating insect 
development will minimize generation time. This will be 

25 very important in circumstances where large numbers of 

insects are desired finally, for instance, in producing 
sterile males in Mediterranean fly infestations. 
Alternatively, it is useful to slow development in some 
pest infestations, such that the insects reach 

30 destructive stages of development only after commercial 

crops have passed sensitive stages. 

In another commercial application, ligands 
discovered by methods provided by the present invention 
will be used in the silk-production industry. Here, the 

35 silkworms are artificially maintained in a silk-producing 

larvae stage, thereby being silk productive for extended 
time periods. The development of larvae will also be 
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susceptible to acceleration to silk-production in a 
shorter time period than than naturally. 

Other analogues of ligands for insect steroid 
receptor super family members will be selected which, upon 
application, will completely disrupt normal development 
and, preferably, lead to a lethal result. Slightly 
modified natural substances will often have greater 
specificity of action and much higher activities, 
allowing for lower levels of application. For example, 
more lipophilic ligands are more readily absorbed 
directly into the insect surface or cuticle. Thus, 
extremely low concentrations of natural ligands should be 
effective in controlling pests. Furthermore, many of 
these ligands are likely to be relatively easily 
manufactured, taking advantage of enzymatic production 
methods. New ligands for insect steroid receptor 
superfamily members will sometimes be more species 
specific or will exhibit particularly useful 
characteristics, for example, being lethal to specific 
harmful insects. The greater specificity of the hormones 
will allow avoidance of the use of non-specific 
pesticides possessing undesired deleterious ecological 
side effects, e.g., pesticide residue accumulation in 
food, often having deleterious effects on humans. 
Furthermore, compounds having structures closely 
analogous to natural compounds should be susceptible to 
natural mechanisms of biological degradation. 

Another aspect of the present invention provides for 
the isolation or design of new gene segments which are 
responsive to ligands for insect steroid receptor 
superfamily members. For example, use of the nucleic 
acids to screen for homologous sequences by standard 
techniques will provide genes having similar structural 
features. Similarly arranged intron structures will 
typically be characteristic of larger superfamily 
categories. The preferred domains for screening will be 
the ligand -binding or DNA-binding segments, although the 
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DNA segments which are recognized by the DNA-binding 
domains, i.e., the controlling elements, will also be of 
particular interest. Screening for new controlling 
elements will usually take advantage of known 
5 similarities, e.g., sequence homology to other known 

elements, or homology to the DNA zinc finger-binding 
domains of other receptors. Receptors and genes 
important in the general developmental sequence of 
expression will be discovered. Using this set of 
10 developmental ly regulated genes will allow selection of 

particular molecules which are responsible for 
controlling expression of developmentally regulated 
genes . 

Kits for the determination of expression levels of 
15 the nucleic acids and proteins provided herein are made 

available. Typically, the kit will have at least one 
compartment which contains a reagent which specifically 
binds to the desired target molecule, e.g., ligand 
analogues, receptors, or nucleic acids. These reagents 
20 will be used in techniques for assays, e.g., using 

methods typically used in screening protocols. See, 
e.g., Sambrook et al. (1989) and Ausubel et al. (1987 and 
supplements) . 

The following experimental section is offered by way 
25 of example and not by limitation. 

EXPERIMENTAL 
EXAMPLE I 

CLONING STRUCTURE AND EXPRESSION OF THE DROSOPHILA 
30 E75 GENE THAT ENCODES TWO MEMBERS OF THE STEROID 

RECEPTOR SUPERFAMILY. 

The following experiments demonstrate that the E75 
gene encodes two members of the steroid receptor 
35 super family. The proteins it encodes share amino acid 

sequence homology with the conserved DNA-binding and 
ligand-binding domains of this superfamily. The E75 gene 
is ecdysone-inducible, and it occupies and causes the 
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ecdysone-inducible early puff at the 75B locus in the 
Drosophila polytene chromosome. 

A. Cloning of Genomic DNA Encompassing the 
Ecdysone-inducible 75B Puff Locus 

We have used the method of chromosomal walking 
(Bender, W. , P. Spierer, and D. S. Hogness, 1983. 
Chromosomal walking and jumping to isolate DNA from the 
Ace and rosy loci and the Bithorax complex in Drosophila 
melanoaaster. J. Mol. Biol. 168:17-33) to isolate the 
genomic DNA encompassing the 75B puff region. The 
starting point for the walk was a genomic clone, 
designated A8253, which had been localized by in situ 
hybridization to the proximal end of 75B. Isolated 
restriction fragments of A8253 were used to screen a 
library of genomic DNA from the Canton S (C s ) strain of 
D. melanocraster . See (Maniatis, T., R. C. Hardison, E. 
Lacy, J. Lauer, C. 0 , Connell, D. Quon, G. K. Sim, and 
A. Efstradiatis, 1978, "The isolation of structural genes 
from libraries of eucaryotic DNA. " Cell 15:687-701). 
Genomic clones AcDm3504 and AcDm3505 were isolated by 
homology to A8253. 

The walk was then extended in both directions until 
-100 kb of genomic DNA had been isolated, and the 
orientation of the walk was determined by in situ 
hybridization of the terminal segments to polytene 
chromosomes. Thereafter, the walk was extended in the 
rightward direction on the molecular map, or distally 
relative to the centromere. The 350 kb of genomic DNA 
encompassed by the walk corresponds to the chromosomal 
region between bands 75A6-7 and 75B11-13, as determined 
by in situ hybridization. This region includes the 75B 
puff, which appears to initiate by simultaneous 
decondensation of chromosomal bands 75B3-5 and then 
spreads to surrounding bands. 



Methods 

Genomic DNA libraries 
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Canton S genomic DNAs were isolated from a library 
of sheared, EcoRI-linkered Canton S DNA cloned into the 
Charon 4 A phage vector. See (Maniatis, T. , 
R. C. Hardison, E. Lacy, J. Lauer, C. O'Connell, D. Quon, 
5 G. K. Sim, and A. Ef stradiatis, 1978. The isolation of 

structural genes from libraries of eucaryotic DNA. Cell 
15:687-701). 0 r genomic DNAs were isolated from a library 
of sheared DNA, GC-tailed into the sep6 A vector. See 
(Meyerowitz, F. M. , and D. S. Hogness, 1982. "Molecular 

10 organization of a Drosophila puff site that responds to 

ecdysone." Cell 28:165-176). One step in the 
chromosomal walk was taken using a cosmid library of 
SauIIIA partially digested 0 r DNA cloned into the cosmid 
pl4Bl by the method of Ish-Horowicz and Burke (Ish- 

15 Horowicz, D., and J. F. Burke, 1982. Rapid and efficient 

cosmid cloning. Nucleic Acids Res. 9:2989-2998). 

In situ hybridization 

In situ hybridization to polytene chromosomes was 

20 carried out with DNA probes that were nick-translated in 

the presence of 3 H-labeled TTP (NEN) , as described by 
Bonner and Pardue (Bonner, J. J., and M. L. Pardue, 1976. 
Ecdysone-stimulated RNA synthesis in imaginal discs of 
Drosophila melanoaaster . Assay by in situ hybridization. 

25 Chromosoma 58:87-99), with the following modifications: 

Heat and RNAase treatments of the slides were omitted, 
and hybridization and washing were at 63 "C in 2XSSPE for 
18 and 2 hours, respectively. 

30 B. Identification of a 50 kb Region of Cloned 

Genomic DNA that Contains Sequences Homologous 
to Ecdvsone- induce d Transcripts 

Restriction fragments of the above genomic clones 
35 were tested for their ability to hybridize with each of 

two cDNA probes, one derived from the RNA in ecdysone- 
induced cells, and the other from the RNA in noninduced 
cells. Two differential screens were carried out. In 
the first, genomic DNA covering the entire 350 kb walk 
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was examined with cDNA probes synthesized with reverse 
transcriptase from an oligo(dT) primer annealed to 
poly (A) + RNA. The poly (A) + RNA was prepared from total 
inner tissues that were mass-isolated from late third 
instar larvae and incubated in the presence of ecdysone 
plus cycloheximide, or cycloheximide alone. (See 
Methods, below. Cycloheximide was included because 
higher levels of ecdysone- induced transcripts accumulate 
in its presence . ) 

Each of the 32 P-labeled cDNA probes made from these 
two poly (A) + RNAs was applied to one of two duplicate 
Southern blots that contained, in addition to the genomic 
fragments from the walk, a control DNA consisting of 
sequences from the ribosomal protein 49 gene (O'Connell, 
P., and M. Rosbash, 1984. Sequence, structure and codon 
preference of the Drosophila ribosomal protein 49 gene. 
Nucleic Acids Res. 12:5495-5513), which was used to 
normalize the hybridization intensities of the duplicate 
blots. This screen revealed sequences specific to 
ecdysone-induced RNAs only within the AcDm3522 genomic 
clone that is centered at approximately +220 kb on the 
molecular map. 

Because the above probes will preferentially detect 
sequences near the 3' termini of the RNAs, particularly 
in the case of long transcripts, a second differential 
screen was carried out with cDNA probes primed with 
random hexamers (see Methods, below) . This screen, which 
was restricted to the 135 kb of genomic DNA between +105 
kb and +240 kb, revealed ecdysone- inducible sequences in 
fragments spread out over an -50 kb region between +170 
kb and +22 0 kb. This region represents the E75 gene. 



Methods 

Organ culture and RNA isolation 
35 Late third instar O r larvae were harvested, washed in 

0.7% NaCl, resuspended in Robb's phosphate-buffered 
saline (PBS) (Robb, J. A., 1968. Maintenance of imaginal 
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discs of Drosophila melanocraster in chemically defined 
media. J . Cell. Biol. 41:876-885), preaerated with a 
blender, and passed through a set of rollers to extrude 
the organs. This "grindate" was filtered through a 
5 coarse Nitex screen to remove carcasses, and settled five 

times (3-5 minutes per settling) by gravity to remove 
floating and microscopic debris. Isolated tissues 
(primarily salivary glands, imaginal discs, gut, and 
Malphigian tubules) were cultured at 25 °C in plastic 

10 petri dishes in aerated Robb's PBS. B-ecdysone (Sigma) 

(0.2 Ml/^1 °f 1° mg/ml) in ethanol and/or cycloheximide 
(2 Ml/ml of 3 5 mM) in water was added to the appropriate 
cultures. Incubations in the presence of cycloheximide 
were for -8 hours. Isolated tissues were homogenized in 

15 10 volumes of 6 M guanidine-HCl/O . 6 M sodium acetate 

(pH 5.2), centrifuged at 5000 g for 10 minutes to remove 
debris, and layered onto a 5.7 M CaCl shelf, as described 
previously (Chirgwin, J. M. , A. E. Przbyla, 
R. J. MacDonald, and W. J. Rutter, 1979. Isolation of 

20 biologically active ribonucleic acid from sources 

enriched in ribonuclease. Biochemistry 18:5294-5299). 
Poly (A) + RNA was purified by oligo(dT) chromatography. 



Southern blot analysis 

25 Southern blots were performed on nitrocellulose, as 

described previously (Segraves, W. A., C. Louis, S. 
Tsubota, P. Schedl, J. M. Rawls, and B. P. Jarry, 1984. 
The rudimentary locus of Drosophila melanoaaster . J\ 
Mol. Biol. 175:1-17). cDNA probes were prepared by 

3 0 reverse transcription (AMV reverse transcriptase; 

Seikagaku) of 2 ng of poly (A) + RNA with 700 ng of 
oligo(dT) 12 " 16 (Collaborative Research) or 15 ng of random 
hexamers (Pharmacia) in a 20 fxl reaction mixture 
containing 80 mM Tris CI (pH 8.3 at 42 e C), 10 mM MgCl 2 , 

35 100 mM KC1, 0.4 mM DTT, 0.25 mM each of dATP, dGTP, and 

dTTP, and 100 nCi of [ 32 P]dCTP (800 Ci/mole; Amersham) . 
After incubation at 37°C for 45 minutes, 80 /il of 10 mM 
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EDTA and 2 fil of 5 H NaOH were added before incubation at 
70"C for 10 minutes to denature the products and 
hydrolyze the RNA. After the addition of 10 pi of 1 M 
Tris-Cl (pH 7.5) and 5 jtil of 1 N HC1, unincorporated 
5 label was removed by chromatography on Biogel P60. 

C. The E75 Gene Contains Two Overlapping 
Transcription Units; E75A and E75B 

10 Northern blot analysis of ecdysone-induced and 

noninduced RNAs, prepared as described above and 
hybridized with strand-specific DNA probes derived from 
cloned restriction fragments in the 60 kb region (+166 to 
+226 kb) containing the E75 gene, demonstrated that this 

15 gene produces two classes of ecdysone-inducible mRNAs , 

both derived from rightward transcription. The E75A 
class of mRNAs hybridized with probes from both the 5' 
(left) and 3' (right) ends of the 50 kb E75 gene. The 
E75B class hybridized only with probes from the 3'- 

20 proximal 20 kb of the gene. These results suggest that 

the A and B classes of ecdysone-inducible RNAs are 
initiated by different promoters, located about 30 kb 
apart, and that the two transcription units defined by 
these promoters overlap in the region downstream from the 

25 B promoter. 

This suggestion was confirmed by analysis of the 
structure of cloned cDNAs from the E75A and E75B mRNAs. 
Approximately 10 6 clones from an early pupal cDNA library 
(Poole, S. J., L. M. Kauvar, B. Drees, and T. Kornberg, 

30 1985. The engrailed locus of Drosophila ; Structural 

analysis of an embryonic transcript. Cell 40:37-40) were 
screened at low resolution with genomic DNA probes from 
the E75 gene region. The 116 cDNA clones identified by 
this screen were analyzed by restriction digestion and 

35 hybridization to a panel of probes derived from the 60 kb 

(+166 to +226 kb) region. One of the clones, ADm4925, 
was thereby selected as a representative of the E75A 



WO 91/13167 



PCI7US91/01189 



47 

class of mRNAs, and another, ADm4745, as a representative 
of the E75B mRNA class. 

The genomic regions homologous to these two cDNA 
clones were further localized by Southern blot analysis, 
5 and the nucleotide sequence of these regions and of both 

cDNA clones was determined. These sequences are given in 
Table 1, along with those derived from 5' and 3' terminal 
sequence determinations for each transcription unit. 
These data demonstrate that the 50 kb E75A transcription 

10 unit consists of six exons, labeled in 5* to 3' order: 

AO, Al, 2, 3, 4 and 5, of which exons AO and Al are 
specific to this unit, while the remaining four are 
shared with the 20 kb E75B transcription unit. 
Similarly, the E75B unit contains a specific exon, 

15 labeled Bl, at its 5' end, which is located just upstream 

of the shared exon 2. Thus, the E75 gene consists of two 
transcription units, of which the shorter E75B unit 
occupies the 3' proximal 20 kb of the longer E75A unit. 
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Table 1. Sequences of the E75 exons and 
flanking DNA. The sequence is that of the C s genomic 
DNA, which was identical to that of the cDNAs, except for 
the T-+G change indicated at position +2691. This change 
would convert a leucine to an arginine in the protein 
sequences. The Dm4925 cDNA extends from just 5' of the 
EcoRV site at +939 to +4267 in A. The Dm4745 cDNA 
extends from +804 in B to a point near the Hindlll site 
at +4246 in A. (A) The E75 A exons and flanking DNA. The 
sequences of the AO, Al, and common exons 2-5 are 
interrupted by intron sequences (lowercase) , which are 
limited to those near the splice sites and are in 
agreement with consensus sequences for donor (5') and 
acceptor (3') splice sites. Negative numbers at the 
right end of each line refer to the number of base pairs 
upstream of the E75 A initiation site, positive numbers 
refer to positions in the E75 A mRNAs, continuing into 
the 3* flanking DNA. Numbers at the left end of each 
line refer to amino acid residues in the E75 A protein. 
The underlined 14 bp sequence at -159 to -172 exhibits a 
13/14 bp match to a sequence (CGTAGCGGGTCTC) found 47 bp 
upstream of the ecdysone-inducible E74 A transcription 
unit responsible for the early puff at 74EF. This 
sequence represents the proximal part of a 19 bp sequence 
in the E74 A promoter that binds the protein encoded by 
the D. melanoaaster zeste gene. Another underlined 
sequence in the E75 A promoter at -74 to -82 is also 
found in the E75 B promoter, where it is part of a 
tandemly repeated octanucleotide (GAGAGAGC) located at 
-106 to -121 in B. This repeat matches the consensus 
sequence for the binding sites of the GAGA transcription 
factor which also binds to the E74 A promoter. Other 
underlined sequences represent, at -27 to -33, the best 
match to the TATA box consensus at an appropriate 
position, three AUG codons that are closely followed by 
in-frame stop codons in the 5 '-leader sequence of the E75 
mRNAs, and alternative polyadenylation-cleavage signals 
at 4591 and 5365 that are used by both E75 A and E75B 
mRNAs . (B) The Bl exon and its 5 '-flanking DNA. The 
numbering at the right and left ends of the lines follows 
the same convention as in A. Exons 2-5 shown in A are 
also used in E75 B, but the amino acid residues and base 
pair numbers shown in A must be increased by 157 and 375, 
respectively, to apply to the E75 B protein and mRNA. 
The first ten nucleotides of the 136-nucleotide E75 
B- intron linking the Bl exon to Exon 2 are gtaggttag, 
whereas the last ten are shown upstream of nucleotide 
1178 in A. The underlined sequences represent, in order, 
the region of homology to a sequence upstream of E75 A, 
noted above, the best match to the TATA box consensus 
at -21 to -27, and three AUG codons followed by in-frame 
stop codons in the 5' leader of the E75 B mRNA. 

Panels 1 and 2 are shown in detail in panels 3-8, 
and 9-10, respectively. 
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Methods 

cDNA libraries 

The ADm4925 and ADm4745 cDNAs were isolated from an 
5 0 r early pupal cDNA library in AgtlO (Poole, S. J., 

L. M. Kauvar, B. Drees, and T. Romberg, 1985. The 
engrailed locus of Drosophila : structural analysis of an 
embryonic transcript. Cell 40:37-40). The two cDNAs 
(ADm4927 and ADm4928) that were used for 3'-end mapping 
10 were isolated from an ecdysone-induced salivary gland 

cDNA library in A607 prepared by C. W. Jones. (Our 
strain collection names for the cDNA clones used in these 
studies are AfDm4925, AfDm4745, AeDm4927, and AeDm4928.) 

15 Northern blot analysis 

Probes to be used for Northern blots were cloned 
into the vector p0X (from R. Mulligan) , containing the 
0X174 origin of replication cloned in between the Hindlll 
and BamHI sites of pBR322. This allowed the synthesis of 

20 single-stranded probe DNA (Arai, K. , N. Arai, J. 

Schlomai, and A. Romberg, 1980. Replication of duplex 
DNA of phage 0X174 reconstituted with purified enzymes. 
Proc. Natl. Acad. Sci. 77:3322-3326), which was performed 
by the incubation of .supercoiled plasmid DNA with gene A 

25 protein . rep and ssb proteins, and DNA polymerase III 

holoenzyme in a reaction containing 20 mM Tris CI 
(pH 7.5), 80 Mg/ml BSA, 4% glycerol, 20 mM DTT, 1 mM ATP, 
16 mM concentrations of the three unlabeled 
deoxynucleotides and 1.6 mM concentrations of the labeled 

3 0 deoxynucleotide for 1 hour at 30 °C. EDTA was then added 

to 20 mM, SDS to 0.1%, and proteinase K to 50 jug/ml. The 
reactions were digested for 3 0 minutes at 37"C, and 
unincorporated label was removed by gel filtration. 



35 51 nuclease protection and primer extension analysis 

Single-stranded probes, prepared as described above 
by the 0X in vitro replication system, were purified by 
electrophoresis on low melting point agarose gels for use 
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as Si probes. All other probes were prepared by 
extension of the -20, 17-mer sequencing primer (New 
England Biolabs) on single-stranded M13mp (Messing, J., 
1983. New M13 vectors for cloning. Methods Enzymol. 
5 101:20-78) or pEMBL (Dente, L. , G. Cesareni, and R. 

Cortes, 1983. pEMBL: A new family of single-stranded 
plasmids. Nucleic Acids Res. 11:1645-1654) recombinant 
templates using 32 P-labeled nucleotides, followed by 
cleavage with the appropriate restriction enzyme and 

10 purification of the probe on denaturing polyacrylamide 

gels. Labeled probe (100,000-300,000 cpm) was incubated 
with l jug of poly (A) + RNA in a 5 /tl reaction mixture 
containing 5 \iq of yeast tRNA, 0.4 M NaCl, 40 mM PIPES 
(pH 6.8), and 1 mM EDTA at 60 *C under oil. Reactions 

15 were cooled and diluted 1:10 into either SI digestion or 

primer extension buffer. SI nuclease digestions were 
performed in 50 mM acetate buffer (Na) , 400 mM NaCl, and 
4 mM ZnSC-4 at 20 'C for 1 hour with -15 - 150 Vogt units of 
SI nuclease (Boehringer) per 50 jul reaction. Primer 

20 extensions were performed at 42 *C in 50 mM Tris CI 

(pH 8.3 at 42°C), 80 mM KC1, 2 mM DTT, 1 mM of dATP, 
dCTP, dGTP, and dTTP, with 20 units of AMV reverse 
transcriptase (Seikagaku) per 50 \i\ reaction. Reactions 
were terminated by the addition of EDTA, tRNA carrier was 

25 added to the SI nuclease digestions, and samples were 

ethanol-precipitated and either electrophoresed directly 
on 5% or 6% denaturing polyacrylamide gels or glyoxalated 
(McMaster, G. K. , and G. C. Carmichael, 1977. Analysis 
of single and double-stranded nucleic acids on 

30 polyacrylamide and agarose gels by using glyoxal and 

acridine orange. Proc. Natl. Acad. Sci. 74:4835-4838) 
and electrophoresed on 1% agarose gels run in 10 mM 
sodium phosphate buffer (pH 6.8). 



35 



DNA sequence analysis 

The cDNA clones ADm4927 and ADm4928 were sequenced 
by chemical degradation (Maxam, A. M. , and W. Gilbert, 
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198 0. Sequencing end-labeled DNA with base-specific 
chemical cleavage. Methods Bnzymol. 65:499-560) . All 
other sequencing was performed using the 
dideoxynucleotide chain termination method (Sanger, F. , 
5 A. R. Coulson, B. F. Barrell, A. J. H. Smith, and 

B. A. Roe, 1980. Cloning in single-stranded 
bacteriophage as an aid to rapid DNA sequencing. J. Mol. 
Biol . 143:161-178). Fragments were cloned into M13mp 
vectors (Messing, J., 1983. New M13 vectors for cloning. 

10 Methods Enzvmol. 101:20-78) or pEMBL (Dente, L. , 

G. Cesareni, and R. Cortes, 1983. pEMBL: A new family of 
single-stranded plasmids. Nucleic Acids Res. 11:1645- 
1654) and sequenced directly or following the generation 
of a set of overlapping deletions using exonuclease III 

15 (Henikoff, S., 1984. Unidirectional digestion with 

exonuclease III creates targeted breakpoints for DNA 
sequencing. Gene 28:351-359) . Sequencing was performed 
on both strands of the ADm4925 cDNA, the B-specific 
region of ADm4745 cDNA, the A- and B-specific 5' genomic 

20 regions not represented in the cDNAs, and the 3 '-flanking 

region. The remaining exon boundaries of ADm4745 and 
genomic regions represented within the cDNA clones were 
sequenced on one strand. 

25 D. The E75 Gene Encodes Two Members of the Steroid 
Receptor Superfamily 

The coding and noncoding sequences of the E75 A and 
B mRNAs, their splice junctions, and the 5' and 3' 

3 0 flanking sequences are shown in Table 1. Certain 

sequences of potential interest within the 5 ' flanking 
DNA and in the 5 1 leader mRNA sequences are indicated in 
the legend to Table 1. We focus here on the large open 
reading frames of the E75 A and B mRNAs that begin at 

35 380 bp and 284 bp downstream from their respective mRNA 

start sites, each continuing into the common final exon. 
The termination codon in exon 5 lies upstream of both 
alternative polyadenylation sites; thus, the sequence of 
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the encoded protein is not affected by which site is 
selected. Since the open reading frames in the E75 A and 
B mRNAs begin in the AO and Bl exons and merge at the 
beginning of exon 2, the proteins encoded by the two 
5 transcription units differ in the amino-terminal region 

and are the same in the carboxy-terminal region. The 
specific amino-terminal regions contain 266 and 423 amino 
acid residues in the E75 A and B proteins, respectively, 
while their common carboxy-terminal region consists of 

10 971 residues. The predicted molecular weights of the A 

and B proteins are thus 132,000 and 151,000. The open 
reading frames display characteristic D. melanoaaster 
codon usage, and their extents have been confirmed by in 
vitro translation of mRNAs transcribed in vitro from cDNA 

15 constructs and by expression of fusion proteins in 

E. coli. The predicted protein sequence for each protein 
is punctuated by homopolymeric tracts of amino acids, 
which are noted in Table 1 and its legend. 

Analysis of the sequences of E75 proteins and 

20 comparison to the sequences of known proteins have 

revealed similarity between the E75 proteins and members 
of the steroid receptor superfamily (Evans, R. M. , 1988. 
The steroid and thyroid hormone receptor superfamily. 
Science 240 : 889-895 ; Green, S., and P. Chambon, 1988. 

25 Nuclear receptors enhance our understanding of 

transcription regulation. Trends in Genetics 4:309-314). 
We have used the nomenclature of Krust el al. (Krust, A., 
S. Green, P. Argos, V. Kumar, P. Walter, J. Bornert, and 
P. Chambon, 1986. The chicken oestrogen receptor 

3 0 sequence: Homology with v-erbA and the human oestrogen 

and glucocorticoid receptors. EMBO J. 5:891-897) in 
dividing the proteins into six regions, A to F, in the 
amino- to carboxy-terminal direction. 

Similarity between E75A and other members of this 

35 superfamily is strongest in the C region, a cysteine- 

lysine-arginine-rich region that is necessary and 
sufficient for the binding of these receptors to DNA (for 



WO 91/13167 



PCI7US91/01189 



53 

review, see, Evans, R. M. , 1988. The steroid and thyroid 
hormone receptor super family. Science 240:889-895; 
Green, S., and P. Chambon, 1988. Nuclear receptors 
enhance our understanding of transcription regulation. 
5 Trends in Genetics 4:309-314). The C region consists of 

66-68 amino acids, of which 20 residues are invariant 
within this family. Among these are nine invariant 
cysteine residues, eight are believed to coordinate zinc 
in the formation of two zinc finger-like structures 

10 (Miller, J., A. D. McLachlan, and A. Klug, 1985. 

Representative zinc-binding domains in the protein 
transcription factor IIIA from Xenoous oocytes. EMBO J. 
4:1609-1614; Freedman, L. P., B. F. Luisi, Z. R. Korszun, 
R. Basavappa, P. B. Sigler, and K. R. Yamamoto, 1988. 

15 The function and structure of the metal coordination 

sites within the glucocorticoid receptor DNA binding 
domain. Nature 334:543-546; Severne, Y. , S. Wieland, 
W. Schaffner, and S. Rusconi, 1988. Metal binding finger 
structure of the glucocorticoid receptor defined by site- 

20 directed mutagenesis. EMBO J. 9:2503-2508) . Within the 

C region, E75A contains all of the highly conserved 
residues and is approximately as closely related to other 
members of the steroid receptor superfamily as they are 
to one another. The closest relative of E75 appears to 

25 be the human ear-1 gene, which has nearly 80% amino acid 

identity to E75 A in the DNA-binding domain. 

The other region conserved among members of the 
steroid receptor superfamily is the E region, which is 
required for steroid binding and for the linkage of 

30 steroid-binding and trans -activation functions (for 

review, see . Evans, R. M. , 1988. The steroid and thyroid 
hormone receptor superfamily. Science 240:889-895; 
Green, S., and P. Chambon, 1988. Nuclear receptors 
enhance our understanding of transcription regulation. 

35 Trends in Genetics 4:309-314). Although overall E-region 

similarity is clearly significant for the comparison of 
E75 A to the thyroid hormone, vitamin D, and retinoic 
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acid receptors, and ear-1, similarity to the 
glucocorticoid and estrogen receptors is considerably 
lower. However, the plots of local similarities show a 
clear similarity to each of these proteins within three 
5 subregions of the E region, denoted El, E2 and E3. The 

El subregion is the most highly conserved and corresponds 
to a region shown by in vitro mutagenesis to be essential 
for steroid binding and steroid-dependent trans - 
activation (Giguere, V., S. M. Hollenberg, 

10 M. G. Rosenfield, and R. M. Evans, 1986. Functional 

domains of the human glucocorticoid receptor. Cell 
46:645-652; Danielson, M. , J. P. Northrop, J. Jonklaas, 
and G. M. Ringold, 1987. Domains of the glucocorticoid 
receptor involved in specific and nonspecific 

15 deoxyribonucleic acid binding, hormone activation and 

transcriptional enhancement. Mol. Endocrinol. 1:816- 
822) . Region E2 is less highly conserved in primary 
amino acid sequence, but can, in part, be seen as a 
conserved hydrophobic region in the hydropathy plots of 

2 0 several of these proteins. A deletion of 14 amino acids 

within this region abolished steroid binding 
(Rusconi, S., and K. R. Yamamoto, 1987. Functional 
dissection of the hormone and DNA binding activities of 
the glucocorticoid receptor. EMBO J. 6:1309-1315). E3 
25 falls close to the end of the region that is absolutely 

required for steroid binding. 

While the characteristic structural features of the 
steroid receptor superfamily are well conserved in E75, 
two novel variations are seen. The first of these 

3 0 concerns the structure of the E75 B protein, which 

contains a major alteration within its putative DNA- 
binding domain. The steroid receptor superfamily DNA- 
binding domain consists of two DNA-binding zinc fingers 
separated by a less conserved linker region. In E75, as 
3 5 in nearly all other genes of this family, an intron is 

found between the two fingers. In E75, this splice marks 
the beginning of the region held in common between the 
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E75 A and B proteins. This results in the E75 A protein 
having two fingers, while the E75 B protein has unrelated 
B-specific sequences in place of the first finger. Other 
sequences within the B-specific amino-proximal region may 
5 contribute to the DNA-binding domain of the E74B protein. 

Alternatively, the B protein might bind DNA with 
only one finger, as GAL4 transcription factor of yeast 
appears to do. It is possible that these structural 
differences imply a functional difference in the DNA- 

10 binding properties of the E75 A and B proteins that might 

allow them to differentially regulate the transcription 
of the late genes that characterize the secondary 
response to ecdysone in different target tissues. 

In this respect, it should be emphasized that the 

15 putative hormone- or ligand-binding domain is represented 

by the E region that is common to the E75A and E75B 
proteins. Thus, these proteins appear to be receptors 
for the same hormone, which may regulate the 
transcription of different sets of genes. These proteins 

20 represent "orphan" receptors, in that their hormone, or 

binding ligand, has not yet been identified. Because 
ecdysteroids are the only known steroid hormones in 
Drosophila , the most obvious candidate for an E75 ligand 
would be ecdysone itself. However, it is unlikely that 

25 this is the case, since the putative hormone-binding 

domain of the E75 proteins does not exhibit the high 
sequence homology to that of the known Drosophila 
ecdysone receptor encoded by the EcR gene (see 
Experimental Example III and Table 2) that would be 

3 0 expected if the E75 proteins were also ecdysone 

receptors. It, therefore, seems likely that the E75 
proteins would bind either a terpenoid juvenile hormone 
or a novel Drosophila hormone. 

The second unusual feature of the E75 proteins is 

35 the presence of a large F region, encompassing nearly one 

half of the proteins. Many of the other receptors have 
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ascribed to this region. 

Methods 

Protein sequence analysis 

Sequence data were compiled using the Bionet system. 
Protein sequence comparison was performed using FAS TP 
(Lipman, D. J., and W. R. Pearson, 1985. Rapid and 
sensitive protein similarity searches. Science 227:1435- 
1441) and Bionet IFIND programs. 

E. Expression Vectors for E75 Proteins 
In order to express the E75 proteins, portions of 
cDNAs and genomic clones were fused in order to generate 
cassettes containing the entire E75 A and E75 B protein 
coding regions. First, BamHI sites were introduced into 
genomic clones upstream of the initial AUGs of the large 
open reading frames. Then, E75 AO exon sequences were 
fused to sequences of a nearly full-length E75 A cDNA, 
and E75 Bl exon sequences were fused to sequences of a 
nearly full-length E75 B cDNA. These cassettes were 
cloned into pGEM3 (Promega) , and transcripts of the open 
reading frames were prepared using T7 polymerase. These 
were then translated in the presence of 35 S-methionine, 
and shown to give rise to proteins of appropriate size. 

These cassettes have been placed into a variety of 
expression vectors, including pUCHsneo/Act for expression 
in Drosophila cells, pSV2 for expression in mammalian 
cells, and pOTS for expression in bacterial cells. 



BamHI sites were introduced directly upstream of the 
initial ATGs of the E75A and E75B coding sequence — at 
the Sspl site upstream of the E75A initial ATG, and at 
the SacII site upstream of the E75B initial ATG. cDNA 
and genomic sequences were joined at the EcoRV site in 
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the AO exon to construct an E75A cassette, and at the 
Mlul in exon 3 to construct an E75B cassette. 



EXAMPLE II 

5 

CLONING, STRUCTURE AND EXPRESSION OF THE EcR AND 
DHR3 GENES THAT ENCODE ADDITIONAL MEMBERS OF THE 
STEROID RECEPTOR SUPERFAMILY. 

10 The following experiments were carried out after the 

primary structure of the E75 gene, and of the two members 
of the steroid receptor superfamily that it encodes, was 
determined (Experimental, Example I). The purpose of 
these experiments was to clone and determine the primary 

15 structure of other steroid receptor superfamily genes 

from Drosophila . and of the proteins they encode. The 
aim was to identify the gene that encodes a Drosophila 
ecdysone receptor, given that the characteristics of the 
E75 gene indicated that it did not encode an ecdysone 

20 receptor. The first stage of the experimental plan was 

to use the conserved sequences in the E75A transcription 
unit that encode the putative DNA-binding domain of the 
E75A receptor protein as a probe to screen a Drosophila 
genomic library to identify sequences encoding the 

25 putative DNA-binding domains of other Drosophila members 

of the steroid receptor superfamily. The second stage 
was to isolate cDNA clones corresponding to the 
identified genes, as well as additional genomic DNA 
clones, to obtain the nucleotide sequence of the complete 

30 coding region (i.e., the open reading frame encoding the 

respective receptors) and the exon-intron organization of 
these genes. 

The experiments described below resulted in the 
cloning and structural characterization of two genes that 
35 satisfy the criteria for bona fide members of the steroid 

receptor superfamily: encoding proteins that exhibit 
amino acid sequence homology to both the DNA-binding and 
the hormone-binding domains that are conserved among 
members of this superfamily. The two genes are called 
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EcR and DHR3 . The EcR gene was originally called DHR23, 
but was renamed EcR after it was shown to encode an 
ecdysone receptor (see Experimental Example III) . The 
DHR3 designation stands for Drosophila Hormone 
Receptor 3 . 

A. Identification and Chromosomal Mapping of EcR 
and DHR3 Genomic Clones 

Initially, Southern blots of total Drosophila 
genomic DNA, digested with one or more restriction 
endonucleases , were probed with a 530 bp fragment of the 
E75A cDNA containing the sequences encoding the putative 
DNA-binding domain of the E75A receptor protein (see 
Experimental Example I) at low and high stringency 
hybridization conditions. 

To isolate the sequences responsible for these low 
stringency bands, this E75A probe was used to screen a 
Drosophila genomic library under the same low stringency 
conditions, counterscreening duplicate filters with E75 
intron probes to eliminate phage containing inserts from 
the E75 gene- Five genome equivalents were screened and 
39 non-E75 containing phage were isolated. The 25 most 
strongly hybridizing clones were divided into six classes 
on the basis of restriction patterns and cross 
hybridization, each class containing between one and six 
independent overlapping genomic inserts. 

For each class, a restriction fragment containing 
the region of hybridization to the E75A probe was 
localized by Southern blotting. Hybridization of probes 
derived from these fragments to genomic Southern blots 
showed that each of the low stringency bands detectable 
by the E75A probe could be accounted for by one of the 
six isolated fragments. 

The nucleotide sequences of the six restriction 
fragments were determined to test whether they represent 
candidate receptor genes. In all cases, DNA sequence 
similarities with the E75A probe were observed that are 
sufficient to account for the hybridization of these 
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fragments with the probe. When the DNA sequences were 
conceptually translated in all six reading frames, four 
of the fragments yielded no significant sequence 
similarity with E75A at the protein level. The remaining 
5 two clones, however, showed predicted amino acid 

sequences with strong similarity to the DNA binding 
domains of the E75A protein and other steroid superfamily 
receptors . 

These two clones represent the EcR and DHR3 genes, 
10 as will become apparent. Probes from these clones were 

used to map the position of these genes in the polytene 
chromosomes by in situ hybridization. The EcR and DHR3 
chromosomal loci were mapped to positions 42A and 46F, 
respectively, in the right arm of the second chromosome. 

15 

B. Structure of the EcR and DHR3 Genes and Their 
CDNAS 

The DHR3 and EcR genomic clones described above were 

20 used to screen a cDNA library prepared from third instar 

tissues treated with ecdysone and cycloheximide. This 
procedure allowed the isolation of a large number of cDNA 
clones, since both genes have a peak period of 
transcription in late third instar after the rise in 

25 ecdysone titer. For each gene, 20 cloned cDNAs were 

purified and their lengths determined. Restriction maps 
for the 10 longest cDNAs from each gene were determined 
and found to be co linear. 

For EcR, a 5534 bp cDNA sequence was obtained from 

3 0 two overlapping cDNA clones. It contains an 878 codon 

open reading frame (ORF) which yields a predicted amino 
acid sequence expected for a member of the steroid 
receptor superfamily (Table 2) , as described in more 
detail below. The length of the largest DHR3 cDNA that 

35 was isolated (clone DHR3-9) is 4.2 kb. The nucleotide 

sequence of this cDNA was determined and found to contain 
a 487 codon AUG- initiated open reading frame (Table 3). 
As described below, the amino acid sequence of the DHR3 
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protein predicted from this sequence demonstrates that 
this protein is also a bona fide member of the steroid 
receptor superfamily. 
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Table 2. The cDNA sequence of the EcR gene. 
Numerals at the left refer to the nucleotide sequences; 
those on the right to the amino acid sequence in the EcR 
protein. Nucleotides 1-5194 are the sequence of EcR-17 
cDNA, while nucleotides 5195-5534 derive from the EcR-9 
cDNA. The underlined sequences in the 5' and 3' 
untranslated regions refer, respectively, to the ATG 
codons and the AATAAA consensus polyadenylation signals. 
Positions of the introns and the donor and acceptor 
splice sequences are indicated above the cDNA sequence in 
small type. The amino acid sequences homologous to the 
conceived DNA-binding (C region) and hormone-binding (E 
region) domains of the steroid receptor superfamily are 
underlined. 

Panel 1 is shown in detail in panels 2 and 3. 
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Table 3 . The cDNA sequence of the DHR3 gene . 
The numbering and underlining of the nucleotide and amino 
acid sequences have the same meaning as in Table 2, and 
the intron positions and donor and acceptor splice 
5 sequences are similarly indicated. The sequence of the 

5' proximal 2338 nucleotides of the DHR3-9 cDNA is shown. 
The sequence of the remainder of this 4.2 kb cDNA was 
determined for only one strand and is not shown. Four 
silent, third-position differences between the cDNA and 
10 genomic DNA sequences are indicated above the cDNA 

sequence . 
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The genomic structure of the EcR and DHR3 genes 
was investigated by isolating additional genomic DNA 
clones that form overlapping sets that contain all of the 
sequences found in the respective cDNA clones. The exons 
contained in these cDNAs were mapped within the genomic 
DNA by comparison of cDNA and genomic clones via Southern 
blot analysis, mapping of restriction cleavage sites, and 
finally, by determination of the nucleotide sequence of 
the genomic DNA in regions that contain the exon/intron 
boundaries. Table 2 and 3 show these boundaries and the 
sequence of the splice junctions for the EcR and DHR3 
genes, respectively. All of these splice junctions 
conform to the splice donor and acceptor consensus 
sequences. 

For EcR . the cDNA sequence shown in Table 2 is split 
into six exons spread over 36 kb of genomic DNA, with the 
ORF beginning in the second exon and ending in the sixth. 
For DHR3 , the cDNA sequence derives from nine exons 
spread over 18 kb, with the ORF beginning in the first 
exon and ending in the ninth . Because the 5 ' and 3 ' ends 
of the respective mRNAs were not mapped, it should be 
emphasized that these genes may have additional noncoding 
exons at their 5' or 3' ends. 

The EcR and DHR3 gene structures differ 
significantly from those of all previously examined 
steroid receptor superfamily genes. Comparison with the 
genes for 11 other receptor homologues for which at least 
partial structural information is available reveals that 
the positions of certain exon boundaries have been 
conserved in evolution. This conservation is most 
striking in the portion of the genes encoding DNA-binding 
domains. In the nine other cases where the structure of 
this region has been examined, the two halves of the DNA- 
binding domain are always encoded by separate exons. If 
we exclude the Drosophila genes knirps , knirps-related , 
and eaon (which are not bona fide receptor homologues 
since they lack the hormone-binding domain sequence 



WO 91/13167 



PCT/US91/01189 



64 

similarity) , these are always small exons, the second one 
invariably ending in the fourth codon beyond the 
conserved Met codon at the end of the C region. Thus, 
these exons each encode one of the two predicted zinc 
5 fingers of the DNA-binding domain. In contrast, both 

zinc fingers of the putative DNA-binding domain of the 
EcR and DHR3 receptors are encoded by a single exon. It 
is possible that our screen specifically selected for 
genes lacking the above intron. The screen selected 

10 genomic clones that hybridize to an E75A cDNA probe that, 

of course, lacks this intron. Genomic sequences 
containing a contiguous sequence encoding the DNA-binding 
domain would be expected to hybridize to this probe 
better than clones from genes containing the intron. 

15 This would explain the successful isolation of the EcR 

and DHR3 genes, and the failure to isolate the genes of 
other Drosophila members of the steroid receptor 
superf amily . 

20 Methods 

Isolation of cDNA and ad ditional genomic clones 

Subclones of the originally isolated DHR3 and EcR 
genomic clones were used to screen a cDNA library 
prepared from third instar tissues treated with ecdysone 

25 and cycloheximide. This library was chosen because both 

genes are relatively highly expressed at the end of third 
instar, and because of the high quality of the library. 
Of the 270,000 primary plaques screened, 20 positives for 
DHR3 and 220 for EcR were detected. Twenty cDNAs for 

3 0 each gene were purified, of which the ten largest for 

each were restriction mapped and found to be colinear. 
cDNA DHR3-9, which extends further in both the 5' and 3' 
directions than our other DHR3 cDNAs, was chosen for 
sequencing. For EcR, the longest cDNA, EcR-17, extended 

35 the farthest 5' and was sequenced in its entirety. An 

additional cDNA clone, EcR-9, was found to extend 300 bp 
farther 3 1 than EcR-17 , and this 3 1 extension was also 



WO 91/13167 



PCT/US91/01189 



65 

sequenced. Additional genomic DNA clones covering the 
EcR and DHR3 genes were obtained by screening the 
Drosophila Canton S genomic library referred to in part A 
above, either with probes from the respective cDNA 
5 clones, or, for overlapping clones, by the chromosomal 

walk method described in Experimental Example I. 

DNA sequence analysis 

cDNAs were subcloned into BlueScript vectors 
10 (Stratagene) , and clones for sequencing were generated by 

exonuclease III digestion (Henikoff, S., 1984. 
Unidirectional digestion with exonuclease III creates 
targeted breakpoints for DNA sequencing. Gene 
28:351-359) . 

15 Double-stranded plasmids were denatured (Gatermann, 

K. B., G. H. Rosenberg, and N. F. Kaufer, 1988. Double- 
stranded sequencing, using mini -prep plasmids, in 11 
hours. BioTechniques 6:951-952) and sequenced by the 
dideoxy chain terminating method (Sanger, F. , S. Nicklen, 

20 and A. R. Coulson, 1977. DNA sequencing with chain- 

terminating inhibitors. Proc. Natl. Acad. Sci. USA 
74:5463-5467), using the enzyme Sequenase (U.S. 
Biochemical) . cDNA EcR-17 was completely sequenced on 
both strands, as was the EcR-9 3' extension. cDNA DHR3- 

25 9 was sequenced on both strands for the 5' most 2338 bp, 

which contains the entire ORF, and the remainder of the 
long 3' untranslated region was sequenced on one strand. 

The exon/intron boundaries in genomic DNA clones 
were first mapped at low resolution by Southern blot 

3 0 analysis of their restriction fragments probed with 

labeled cDNAs. Genomic DNA surrounding each exon/intron 
boundary was subcloned and the nucleotide sequence of 
these subclones determined as above. 

Genomic exons were either sequenced entirely, or for 

35 the longer exons, were digested and electrophoresed in 

parallel with cDNA clones to confirm the colinearity of 
the genomic and cDNA clones. Shorter exons were 
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completely sequenced from genomic clones. Longer exons 
had their boundaries sequenced from genomic clones, and 
were confirmed to be colinear with the cDNA clones by 
parallel digestion and electrophoresis of the cDNA and 
5 genomic clones. 

C. The Predicted Amino Acid Sequence of the EcR 
and DHR3 Proteins and their Implications 

10 Comparison of the predicted EcR and DHR3 protein 

sequences to the sequence database and to individual 
members of the steroid receptor superfamily shows that 
these proteins share the two conserved domains 
characteristic of this superfamily (Evans, R. M. , 1988. 

15 The steroid and thyroid hormone receptor superfamily. 

Science 240:889-895; Green, S., and P. Chambon, 1988. 
Nuclear receptors enhance our understanding of 
transcription regulation. Trends in Genetics 4:309-314). 
We refer to the domains as the C and E regions, for the 

20 more amino-terminal and more carboxy-terminal homologies, 

respectively, according to the nomenclature of Krust et 
al. (Krust, A., S. Green, P. Argos, V. Kumar, P. Walter, 
J. M. Bornert, and P. Chambon, 1986. The chicken 
oestrogen receptor sequence; homology with v-erbA and the 

25 human oestrogen and glucocorticoid receptors. EMBO J. 

5:891-897). These domains are underlined in Tables 2 and 
3, and Table 4A-C presents a comparison of these domains 
from EcR and DHR3 with those from representative members 
of the superfamily. 
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Table 4. Sequence comparison of the conserved 
C and E regions in DHR3, EcR, and some representative 
nuclear receptor homologues. (A) C-region alignment. 
5 Numbers at the left indicate the amino acid positions 

within the individual receptors; dashes indicate gaps 
introduced to obtain maximal alignment. Dots indicate 
three positions important in determining the DNA binding 
specificity of this domain. (B) E-region alignment. 

10 Bars indicate the three most highly conserved stretches 

within this domain. (C) Computed percent identities 
among the C-region sequences (lower left) and among the 
E-region sequences (upper right) . The kni sequence shows 
no significant E-region homology and is, therefore, not 

15 included in this comparison. Sequences shown are from: 

E75A, Drosophila ecdysone-inducible gene at 75B; kni, 
Drosophila segmentation gene knirps ; hRARa, human 
retinoic acid receptor alpha; htRB, human thyroid 
receptor beta; hVDR, human vitamin D receptor; cOUP-TF, 

20 chicken ovalbumin upstream promoter transcription factor; 

hERRl and hERR2, human estrogen-related receptors 1 and 
2; hER, human estrogen receptor; hGR, human 
glucocorticoid receptor; hMR, human mineralocorticoid 
receptor; hGR, human progesterone receptor. 

25 Panel 1 is shown in detail in panels 2-6. 
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The C region is a 66-68 amino acid domain that 
has been shown to function as a zinc finger DNA binding 
domain in vertebrate receptors. This domain has also 
been implicated in receptor dimerization (Kumar, V. , and 
5 P. Chambon, 1988. The estrogen receptor binds tightly to 

its responsive element as a ligand- induced homodimer. 
Cell 55:145-156). As shown in Table 4A, all 19 C-region 
residues that are absolutely conserved in the other 
receptor homologues are also conserved in DHR3 and EcR, 

10 including the nine invariant Cys residues, eight of which 

coordinate two zinc ions (Freedman, L. P. , B. F. Luisi, 
Z. R. Korszun, R. Basavappa, P. B. Sigler, and 
K. R. Yamamoto, 1988. The function and structure of the 
metal coordination sites within the glucocorticoid 

15 receptor DNA binding domain. Nature 334:543-546). As 

seen in Table 4C, the Drosophila C-region sequences 
(including those of E75A) are not more closely related to 
each other than they are to those from the vertebrate 
receptor homologues. The C region of DHR3 is most 

20 similar to that of the human retinoic acid receptor a 

(hRARa) , and the C region of EcR is most similar to that 
of the human thyroid receptor B (hTRB) . Studies on the 
human glucocorticoid receptor (hGR) and human estrogen 
receptor (hER) have identified three C-region residues 

25 (indicated by dots in Table 4 A) that are critical for 

determining the differential DNA binding specificity of 
these receptors (Mader, S., V. Kumar, H. de Verneuil, and 
P. Chambon, 1989. Three amino acids of the estrogen 
receptor are essential to its ability to distinguish an 

30 estrogen from a glucocorticoid-responsive element. 

Nature 338:271-274; Umesono, K. , and R. M. Evans, 1989. 
Determinants of target gene specificity for 
steroid/thyroid hormone receptors. Cell 57:1139-46). 
The three Drosophila proteins DHR3 , EcR, and E75A, as 

3 5 well as the vertebrate receptors hRARa , hTRB, and the 

human vitamin D receptor (hVDR) , all have identical amino 
acids at these three positions; thus, these proteins may 
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all have similar DNA binding specificities, as has 
already been shown for hRARa and hTRB (Umesono, K. , 
V. Giguere, C. K. Glass, M. G. Rosenfeld, and 
R. M. Evans, 1988. Retinoic acid and thyroid hormone 
5 induce gene expression through a common responsive 

element. Nature 336:262-265) . 

The E-region is an -225 amino acid domain that 
functions as a hormone-binding domain in vertebrate 
receptors. This domain has also been implicated in 

10 hormone dependent receptor dimerization (Kumar, V. and 

P. Chambon, 1988. The estrogen receptor binds tightly to 
its responsive element as a ligand- induced homodimer. 
Cell 55:145-156; Guiochon, M. A., H. Loosfelt, P. Lescop, 
S. Sar, M. Atger, A. M. Perrot, and E. Milgrom, 1989. 

15 Mechanisms of nuclear localization of the progesterone 

receptor: evidence for interaction between monomers. 
Cell 57:1147-1154), hormone dependent nuclear 
localization of the glucocorticoid receptor (Picard, D. , 
and K. R. Yamamoto, 1987. Two signals mediate hormone- 

20 dependent nuclear localization of the glucocorticoid 

receptor. EMBO J. 6:3333-3340), and binding of the 
glucocorticoid receptor to the 90 kDa heat shock protein 
(Pratt, W. B., D. J. Jolly, D. V. Pratt, 
W. M. Hollenberg, V. Giguere, F. M. Cadepond, 

25 G. G. Schweizer, M. G. Catelli, R. M. Evans, and 

E. E. Baulieu, 1988. A region in the steroid binding 
domain determines formation of the non-DNA-binding , 9 S 
glucocorticoid receptor complex. J. Biol. Chem. 263:267- 
273) . Table 4B shows an alignment of the E regions of 

3 0 the DHR3 and EcR proteins with those of other receptor 

homologues. The three relatively highly conserved 
stretches within this region noted in Experimental 
Example I are overlined; each contains a cluster of 
residues conserved in all or most of the receptor 

3 5 sequences. DHR3 and EcR show strong similarity to each 

other and to the other proteins in these stretches, and a 
lower similarity outside of them. The presence of this 
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E-region homology establishes these proteins as bona fide 
members of the nuclear receptor family, in contrast to 
the Drosophila knirps (Nauber, U. , M. J. Pankratz, 
A. Kienlin, E. Seifert, U. Klemm, and H. Jackie, 1988. 
5 Abdominal segmentation of the Drosophila embryo requires 

a hormone receptor-like protein encoded by the gap gene 
knirps. Nature 336:489-492), knirps-related (Oro, A. E. , 
E. S. Ong, J. S. Margolis, J. W. Posakony, M. McKeown, 
and R. M. Evans, 1988. The Drosophila gene knirps- 

!0 related is a member of the steroid-receptor gene 

superfamily. Nature 336:493-496), and ecron (Rothe, M. , 
U. Nauber, and H. Jackie, 1989. Three hormone receptor- 
like Drosophila genes encode an identical DNA-binding 
finger. EMBO J. 8:3087-3094) proteins, which show C- 

15 region homology but no E-region homology. The E region 

in DHR3 is most similar to that of E75A, and the E region 
of EcR is most similar to that of hTRfi, although the 
level of these similarities is lower than those found 
among E regions of many other receptors (Table 4C) . 

20 Thus, DHR3 and EcR are not especially close homologues of 

any previously cloned receptors. Comparison of E-region 
sequences allows division of the nuclear receptors into 
subfamilies (Petkovich, M. , N. J. Brand, A. Krust, and 
P. Chambon, 1987. A human retinoic acid receptor which 

25 belongs to the family of nuclear receptors. Nature 

330:444-450), the members of any one subfamily being more 
related to each other than to those in other subfamilies. 
The DHR3 and EcR receptors fall into a subfamily with the 
E75A, E75B, hRARa, hTRB, and hVDR receptors. 

30 

D. In Situ Labeling of the EcR and DHR3 Proteins 

with Antibodies Induced by Proteins Produced in 
E. coli 

35 To determine the intracellular and tissue 

distribution of the EcR and DHR3 proteins in Drosophila . 
affinity-purified polyclonal antibodies directed against 
those proteins were produced in the following manner. 
The region of about 120 amino acid residues located 
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between the conserved DNA-binding and hormone-binding 
domains of these proteins was used as the immunogen to 
produce antibodies against each protein. Thus, the 
coding sequences for amino acids 335-447 of the EcR 
5 protein and for amino acids 164-289 of the DHR3 protein 

(see Tables 2 and 3, respectively) were cloned into the 
appropriate pATH (Dieckmann, C, and A. Tzagaloff, 1985. 
J. Biol. Chem. 260:1513-1520) or pUR expression vectors, 
so as to fuse these coding sequences to those encoding E. 

10 coli B-galactosidase (Bgal) or to E. coli tryptophan E 

protein (trpE) , respectively. 

The Bgal fusion proteins were produced in E. coli by 
the addition of the IPTG inducer to exponential cultures, 
while the production of trpE fusion proteins were induced 

15 by dilution into tryptophan-free media and subsequent 

addition of indoleacetic acid. For EcR, the trpE fusion 
protein was used as an immunogen and the Bgal fusion 
protein was used on immunoblots to test sera for 
immunoreactivity to the EcR portion of the fusions. For 

20 DHR3, the Bgal fusion protein was injected, and sera were 

checked against the trpE fusion protein. 

For immunization the appropriate fusion protein was 
prepared by electrophoresis on SDS-PAGE gels and 
visualized by staining in ice-cold 0.25 M KC1, after 

25 which the fusion protein band was cut out. Approximately 

100 ng of fusion protein in 0.25 ml of gel slice was 
crushed by passing through successively smaller 
hypodermic needles, and mixed with 0.25 ml of a sterile 
saline solution and 0.5 ml of Freund's complete adjuvant. 

30 For each immunogen, two New Zealand White rabbits were 

injected at multiple intramuscular sites, and after one 
month, boosted at two-week intervals, omitting the 
Freund's adjuvant. While the Bgal fusion proteins were 
subject to the above gel electrophoresis without prior 

35 purification, the trpE fusion proteins were first 

purified by the following method which takes advantage of 
their insolubility in vivo . 
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E. coli from a two-liter culture of induced cells 
were washed, and the cell pellet was subjected to several 
freeze/thaw cycles. The cells were resuspended in 18 ml 
of 50 mM Tris-HCl, pH 7.5, 0.5 mM EDTA, and 1.8 ml of 10 
5 mg/ml lysozyme was added. After 15 minutes on ice, the 

cells were lysed by passing three times through a french 
pressure cell at 10,000 psi. The insoluble fraction was 
collected by centrifugation at 27,000 x g for 15 minutes, 
and washed by resuspension, using a Dounce homogenizer, 

10 in ice-cold 50 mM Tris-HCl, 0.5 mM EDTA, 0.3 M NaCl, 

followed by centrifugation as above. The washing step 
was repeated, and the final pellet dissolved in 10 ml of 
4M urea, 2% (w/v) SDS, 50 mM Tris-HCl, pH 7.5, 1 mM EDTA, 
5% (v/v) 2-mercaptoethanol. Material remaining insoluble 

15 was centrifuged out and discarded. 

The antisera was affinity purified in a two-step 
procedure by successive passage through "nonspecific" and 
"specific" affinity columns. In the case of antibodies 
raised against the trpE fusion proteins, the nonspecific 

20 column consisted of resin coupled to the insoluble 

protein derived from E. coli expressing unmodified trpE 
protein, and was used to remove antibodies directed 
against trpE epitopes, as well as against insoluble E. 
coli protein impurities. The specific column consisted 

25 °f resin coupled to the EcR-trpE fusion protein (purified 

as described above) and was used to absorb the desired 
antibodies directed against the EcR epitopes, antibodies 
that were subsequently released from the column. In the 
case of antibodies raised against the Bgal fusion 

30 proteins, the same general procedure was used, except 

that the resin in the nonspecific column was coupled to 
6-galactosidase, while that in the specific column was 
coupled to the DHR3-6gal fusion protein. Western blot 
analysis of the appropriate £. coli extracts demonstrated 

35 that these affinity-purified antibodies exhibited the 

desired specificity. 
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The intracellular distribution of the EcR protein in 
late third instar salivary glands was examined by in situ 
labeling of this protein with the anti-EcR antibody. The 
EcR protein was thereby shown to be highly localized in 
5 the nuclei of these glands. Indeed, when the polytene 

chromosomes in these nuclei were examined by the 
antibody-labeling method of Zink and Paro (Zinc, B. , and 
R. Paro, 1989. Nature 337:468-471), specific loci within 
these chromosomes exhibited strong binding of the EcR 

10 protein. In particular, the EcR protein was bound to the 

early puff loci, including those occupied by the E75 and 
E74 genes. This is the result expected if the ecdysone 
receptor encoded by the EcR gene is that which induces 
the transcription of the early genes, as anticipated by 

15 the Ashburner model. Another prediction of the Ashburner 

model is that the ecdysone-receptor complex initially 
represses the genes responsible for the later puff, so 
that the transcription of the late genes induced by the 
early gene proteins is delayed until these proteins 

20 accumulate sufficiently to overcome this initial 

repression. If the EcR receptor is involved in this 
postulated initial repression, then one would expect the 
EcR protein to bind to the late puff loci in the salivary 
glands. This expectation was met by the observation that 

25 EcR protein also binds to the late puff loci in the 

polytene chromosomes. 

Additional in situ antibody labeling experiments 
demonstrated that the EcR protein is present in the 
nuclei of all ecdysone target tissues examined in late 

3 0 third instar larvae. It is also present in most, if not 

all, cells during embryogenesis and other stages of 
Drosoohila development that have been examined. In this 
respect, the EcR protein was not detected by anti-EcR 
antibody labeling of embryos in which the EcR gene was 

35 eliminated by a chromosomal deletion, further 

demonstrating the specificity of this antibody. 
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In contrast to the widespread distribution of the 
EcR protein, anti-DHR3 antibody labeling of embryos 
demonstrated that the distribution of the DHR3 protein is 
highly restricted during this stage of development. 
5 During the brief embryonic period of expression, the 

protein is restricted to the peripheral nervous system, 
and to cells surrounding the spiracles at the posterior 
end of the embryo. 

Finally, it should be noted that affinity-purified 

10 antibodies against the E75A protein have also been 

prepared by the same technique described above for anti- 
EcR and anti-DHR3 antibodies. In situ antibody labeling 
of the E75A protein in larval salivary glands has also 
demonstrated that this protein is localized in the 

15 nucleus and is bound to specific loci in the polytene 

chromosomes . 



EXAMPLE III 

THE ECDYSTEROID-BINDING , DNA-BINDING AND GENETIC 
20 REGULATORY PROPERTIES OF THE EcR PROTEIN DEMONSTRATE 

THAT IT IS AN ECDYSONE RECEPTOR. 

The following experiments demonstrate that the 
protein encoded by the EcR gene is an ecdysone receptor 

25 by the following three criteria. (1) The EcR protein 

binds ecdysteroids and accounts for a large proportion, 
if not all, of the ecdysteroid-binding activity present 
in Drosophila embryos and in a variety of cultured 
Drosophila cells. (2) The EcR protein binds with high 

30 specificity to a DNA sequence that functions as an 

ecdysone response element (EcRE) , i.e., an enhancer that 
confers ecdysone inducibility to a promoter. (3) Cells 
that do not respond to ecdysone because they lack 
functional ecdysone receptors are transformed to the 

35 ecdysone-responsive state by transfection with an EcR 

expression plasmid. 



A. The EcR Protein Binds Ecdvsteroiris 
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The EcR expression plasmid, pMTEcR, shown in 
Figure 1 contains the open reading frame encoding the EcR 
protein ( EcR ORF; see Experimental Example II) fused to 
the Drosophila metal lothionine promoter (Pmt) at i ts 5 ' 
5 end, and the polyadenylation-cleavage sequences of the 

Drosophila Actin 5C gene at its 3 1 end. Because 
transcription of the EcR ORF is under the control of this 
metallothionine, that transcription is induced by Cu 2+ ion 
to yield an mRNA that, in turn, produces the EcR protein. 

10 A cell line, MtEcRHy, that overproduces this protein upon 

Cu 2+ induction, as determined by Western blot analysis 
using the affinity-purified anti-EcR antibody (see 
Experimental Example II) , was constructed by the stable 
integration of the pMTEcR plasmid DNA into the genome of 

15 Drosophila Sch-2 cell line. A control cell line, MtHy, 

was similarly constructed by the integration of the 
expression vector DNA lacking the EcR ORF. 

Whole cell extracts were prepared from both the 
MtEcRHy and MtHy cell lines after Cu 2+ induction, and were 

20 assayed for ecdysteroid-binding activity using the high 

affinity ecdysone analogue [ 125 I] iodoponasterone A. The 
MtEcRHy extract contained sevenfold more saturable 
ecdysteroid-binding activity than the MtHy control 
extract. 

25 To see if the induced ecdysteroid-binding activity 

was due to the EcR polypeptide itself, the EcR protein 
was depleted from the MtEcRHy extract by 
immunoprecipitation using an affinity-purified anti-EcR 
polyclonal antibody, or, as a control, the extract was 

30 mock-depleted with preimmune serum. The treated extracts 

were then assayed for ecdysteroid-binding activity. 
Comparison of the immuno-depleted extract with the mock- 
depleted extract showed that most of the binding activity 
was removed by the anti-EcR antibody treatment, 

35 indicating that the induced ecdysteroid-binding activity 

results from the EcR protein. 
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The endogenous ecdysteroid-binding activity in the 
control cell line, MtHy, was unchanged by Cu 2+ exposure, 
and was approximately the same as that in the Sch-2 cell 
from which it derives. The question arises as to whether 
the endogenous activity in these and other Drosophila 
cell lines, as well as in embryonic extracts, results 
from the expression of the EcR gene in their respective 
genomes. To answer this question, extracts from embryos 
and several cell lines were immuno-depleted and mock- 
depleted, as described above, and assayed for 
ecdysteroid-binding activity. Again, comparison of these 
treated extracts showed that the large majority of the 
endogenous binding activity was removed in each case by 
treatment with the anti-EcR antibody. Thus, it appears 
that most, if not all, of the endogenous binding activity 
in embryos and cell lines results form the resident EcR 
gene. 

Methods 
Extracts 

Tissue culture cell extracts for hormone and DNA- 
binding experiments were prepared as follows. Cells were 
grown in spinner flasks to a density of 5-7xl0 6 cells/ml, 
and were washed once in EcR buffer (25 mM Hepes, pH 7.0, 
40 mM KC1, 10% (v/v) glycerol, 1 mM EDTA, 1 mM 
dithiothreitol, and the following cocktail of protease 
inhibitors: 10 mM Na 2 S 2 0 5 , 500 /iM PMSF, 1 /iM leupeptin, 
1 nK. pepstatin) . All further manipulations were at 4°C. 
Cells were resuspended in EcR buffer at 2% of the 
original culture volume, divided into 3 ml aliquots, and 
sonicated using 30 1/2 second pulses with a probe 
sonicator (Bronson Sonifier 450) , resulting in 
disruptions of -95% of the cells. After centrifugation 
at 100,000 x g for 1 hour, 100 jtl aliquots of supernatant 
were frozen in liquid nitrogen, and stored at -80 °C. 
Protein concentration was determined using bovine serum 
albumin as the standard, and was typically 6-11 mg/ml. 
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Embryo extracts were prepared by a similar protocol: 
3-6 hour Canton S embryos were dechorionated in 55% 
commercial bleach for 2 minutes, washed extensively in 
0.7% NaCl, and resuspended using 2 grams of embryos per 
5 ml of EcR buffer. Embryos were broken with 20 strokes in 

a Dounce homogenizer using a B pestle, and lysis was 
completed with the probe sonicator using the same 
settings as used for the tissue culture cells. The 
extract was adjusted to 400 mM KC1, centrifuged 1 hour at 
10 100,000 x g, and aliguots of supernatant were frozen. 

This extract contained 13.4 mg/ml protein. Before use in 
hormone binding, it was diluted tenfold in EcR buffer 
lacking KC1 to bring the final KC1 concentration to 
40 mM. 

15 

Hormone-binding assays 

For hormone-binding experiments, extracts were first 
diluted to the following concentrations in EcR buffer: 
0.9 mg/ml for MtHy and MtEcRHy extracts, 3 mg/ml for S2 

20 and SRS 1.5 extracts, 4 mg/ml for the Kc cell extracts, 

and 1.3 mg/ml for the embryo extract. All manipulations 
were done on duplicate samples in order to guantify 
variability in the results. For immunoprecipitation 
experiments, extracts' were immuno-depleted, mock- 

25 depleted, or left untreated. For depletions, 300 fil of 

diluted extract was incubated for 30 minutes at 25 °C with 
3.5 nl affinity-purified anti-EcR antibody, or with 
3.5 nl preiramune serum for the mock-depletion control . 
Then 38 /xl 10% Staphylococcus aureus (Pansorbin, 

30 Calbiochem) in EcR buffer was added, and incubation was 

continued for 15 minutes at 25 *C. After centrifugation 
for 3 minutes in a microcentrifuge, the supernatant 
(depleted extract) was recovered. The 

immunoprecipitation was repeated, except in the case of 
3 5 the embryo extract, which was subjected to only one round 

of precipitation. The "untreated" extract aliguots were 
left at 4°C for the duration of the depletion procedure, 
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and were diluted with EcR buffer to match the final 
concentration of the depleted aliquots. 

A modification of the hormone-binding assay of P. 
Cherbas was used (Cherbas, P. 1988. Proc. Nat'l Acad. 
Sci. , U.S.A. 85:2096-2100). Assay tubes contained 140 jiil 
extract, 14 jul [ 125 I] iodoponasterone, and either 14 Ml 
EcR buffer or 14 y.1 unlabelled 20-OH ecdysone in EcR 
buffer as a competitor. [ 125 I] iodoponasterone was 2177 
Ci/mM and was used at a final concentration of 5xl0" 10 M in 
the assay; 20-OH ecdysone was 2xl0" s M final concentration 
in the assay. After incubation for 1 hour at 25 °C, each 
reaction was spotted on a dry Whatman GF/C filter 
(2.4 cm), and after 30 seconds the filter was washed by 
using a vacuum to draw 10 ml EcR buffer through the 
filter over a period of 1 minute. Filters were placed in 
800 Ml 4% SDS, and radioactivity was measured in a 7 
counter. The hormone-binding activities shown are 
saturable binding activities, calculated as the total 
binding activity, as measured in assays with no added 
competitor, minus the unsaturable binding activity, which 
was measured in the assays with excess unlabelled 
ecdysone added. In the most active extracts, the 
unsaturable activity (representing the large number of 
low affinity binding sites in the extract) was less than 
10% of the total activity. 

B. Genetic Regulatory Activity of the EcR Protein 
in vivo 

An ecdysone-inducible reporter plasmid, 
pEcRE/Adh/Bgal (Figure 2) , was constructed to test the 
regulatory functions of the EcR protein in vivo . The 
reporter gene in this plasmid consists of the sequence 
that encodes the E. coli B-galactosidase ( Bcral ORF) 
linked through the 5' leader sequence of the Drosophila 
Ultrabithorax gene ( UBX leader and AUG ) to an ecdysone- 
inducible promoter. This promoter was created by fusing 
a truncated version of the proximal promoter for the 
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Drosophila Adh gene (PDAdh-34+53f the numbers indicating that 
it consists of the sequence from base pair positions -34 
to +53, which just includes the TATA box) to seven 
repeats of a 34 bp synthetic oligonucleotide (7 EcRE 
5 OLIGOS) which contains the ecdysone response element 

(EcRE) from the ecdysone-inducible heat shock gene hsp 27 
(Riddihough and Pelham, 1987. EMBO_J i _ 6:3729-3734). The 
seven EcREs should confer ecdysone-inducibility to the 
truncated promoter, provided that the cells transfected 

10 with this reporter plasmid contain the appropriate 

ecdysone receptor. 

This ecdysone-inducible reporter plasmid was 
constructed by insertion of the 7 EcRE OLIGOS into 
plasmid pAdh/Bgal, which is identical to pEcRE/Adh/ Bgal 

15 except that it lacks the array of ecdysone response 

elements. The pAdh/Bgal plasmid should therefore not be 
ecdysone inducible and can serve as a control. To test 
these expectations, Sch-2 cultured cells (which were 
shown above to contain endogenous ecdysone-binding 

20 activity) were transfected with each plasmid and examined 

for B-galactosidase activity in the presence and absence 
of ecdysone. The ecdysone- induced B-galactosidase 
activity in the pEcRE/Adh/Bgal transfected cells was 
2000-fold greater than when such cells were not exposed 

25 to ecdysone, whereas ecdysone had little effect on the 

pAdh/Bgal transfected cells. These results indicate that 
the EcREs confer ecdysone-inducibility on the PoAdh-34+53 
promoter, as expected, and that the Sch-2 cells contain 
functional ecdysone receptors. 

30 To test the function of the EcR receptor in such a 

system, host cells lacking functional ecdysone receptors 
are required. "Ecdysone-resistant" cells lacking 
ecdysone-binding activity, and hence, presumably 
functional receptors, can be produced by continuously 

35 exposing ecdysone-responsive cells to ecdysone during a 

period of several weeks. This ecdysone-resistant state 
is then maintained in ecdysone-f ree media for several 
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months. An ecdysone-resistant cell line, SRS 1.5, was 
therefore generated by growing Sch-2 cells in 3xl0" 6 M 
ecdysone. The SRS 1.5 cells lack significant ecdysone- 
binding activity. 

When these cells were trans fected with the 
pEcRE/Adh/Bgal plasmid and subsequently exposed to 
ecdysone, very little ecdysone- induced 6-galactosidase 
activity was observed, indicating that the cells have 
only trace amounts, if any, of functional receptors. To 
test whether the expression of the EcR gene can "rescue" 
this deficiency, the SRS 1.5 cells were cotransfected 
with two plasmids: the ecdysone-inducible reporter 
plasmid, PEcRE/Adh/Bgal , and a constitutive expression 
plasmid for the EcR gene, pActEcR, in which transcription 
of the EcR ORF is .controlled by the Drosophila Actin 5C 
promoter, P^sc (Figure 3) . Cotrans feet ion with these two 
plasmids, followed by exposure to ecdysone, resulted in a 
dramatic induction of 6-galactosidase activity. Thus, 
introduction of this EcR expression plasmid into the 
SRS 1.5 cells regenerated the ecdysone-inducibility they 
had lost. 

Methods 

Construction of the pAdh/Baal, pEcRE/Adh/Baal and PActEcR 
plasmids 

Plasmid pAdh/Bgal was constructed in two steps. The 
Bglll-Scal fragment of pDA5'-34, containing nucleotides - 
34 to +53 of the Drosophila Adh distal promoter, was 
cloned into pUC18 cut with Seal and BamHI. The resulting 
plasmid was cut with EcoRl, and the EcoRl fragment of 
cPBbxd6.2 (containing the Ubx untranslated leader and 
AUG, the Bgal open reading frame, and the SV40 splice and 
poly A signals) inserted. 

To construct pEcRE/Adh/Bgal from pAdh/Bgal, two 34- 

residue oligonucleotides were synthesized: 

5 1 TCGAGAGACAAGGGTTCAATGCACTTGTCCAATG3 1 

3 • CTCTGTTCCCAAGTTACGTGAACAGGTTACAGCT5 • 
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These will anneal to form 30 bp duplexes with Sail 
compatible four nucleotide overhangs at their 5' ends, as 
shown. Further annealing via the 5' overhangs allows 
formation of tandem arrays that can be inserted into 
pAdh/Bgal at its Sail site just upstream from the TATA 
box of the truncated Adh promoter. When these 
oligonucleotides were kinased, annealed, ligated into 
Sail-cut pAdh/Bgal and cloned, pEcRE/Adh/Bgal was 
obtained. Restriction mapping showed that it contained a 
tandem array of seven 34 bp repeats, each of which 
contains the 23 bp ecdysone response element (EcRE) 
present in the hsp 27 gene, the remaining 11 bp 
representing flanking hsp 27 sequences and the 5' 
overhangs . 

The constitutive EcR expression plasmid, pActEcR, 
was formed by inserting the Fspl-Hpal fragment of an EcR 
cDNA containing bp 851-4123 that contains the ORF 
encoding the EcR protein (Table 2) , into the EcoRV site 
of the ActSV40BS plasmid. This expression vector was 
constructed in two steps by inserting the Xbal-EcoRl 
fragment of cosPneoB-gal , containing the SV40 splice and 
poly A signals, into BlueScript+KS (Stratagene) cut with 
SacII and Xbal, blunting the EcoRl and SacII ends. The 
resulting plasmid was digested with BamHl and Apal, and 
the BamHl-EcoRl fragment of pPAc was inserted, with the 
Apal and EcoRl ends being blunted. 

Transfection and generation of the cell line SRS 1.5 

The cell line SRS 1.5 was obtained by growing 
Schneider line 2 (Sch-2) cells in the presence of 3xl0" 6 M 
20-OH ecdysone (Sigma) . This treatment initially halts 
growth of Sch-2 cells, but after several weeks the 
adapted cells grow well. SRS 1.5 cells were washed in 
hormone-free medium and passed several times in hormone- 
free medium prior to their use in transfection 
experiments. Cells were transfected by the calcium 
phosphate technique. Cells were transfected with 10 p.q 
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of each plasmid used; when only a single plasmid was 
being transfected, 10 fig of pUC18 DNA was added as a 
carrier. In general, all transfections were carried out 
in duplicate. Twenty-four hours after transf ection, 
cells that were to undergo hormone treatment were split 
into two dishes, one of which was treated with 2xl0" 6 M 
20-OH ecdysone. 

fi-qalactosidase assays 

Forty-eight hours after transfection, 2 ml of cells 
were washed once in PBS (137 mM NaCl, 27 mM KC1, 65 mM 
Na 2 HP0 4 , 15 mM KH 2 P0 4 , pH 6.8), and were resuspended in 
50 /il of 0.25 M sucrose, 10 mM Tris, pH 7.4, 10 mM EDTA, 
and repeatedly frozen in liquid nitrogen and thawed in a 
37 'C water bath for a total of 3 freeze/thaw cycles. 
Cell debris was removed by a 10-minute centrifugation in 
a microcentrifuge at 4°C. The concentration of protein 
in the supernatant (cell extract) was determined by the 
Bradford method, with bovine serum albumin as a standard, 
and was typically 1.5-2.5 mg/ml. Extracts were assayed 
immediately or frozen and assayed up to two weeks later 
with no loss in activity. To 10 jul of extract, or an 
appropriate dilution, 500 fil of assay buffer was added 
(0.6 mM 4-methylumbelliferyl-B-D-galactoside, 60 mM 
Na 2 HP0 4 , 40 mM NaH 2 P0 4 , 10 mM KC1, 1.0 mM MgS0 4 , pH 7.0). 
After a 30-minute incubation at 37*C, reactions were 
stopped with 500 nl of 300 mM glycine, 15 mM EDTA, 
pH 11.2. The fluorescent reaction product was quantified 
on a Perkin-Elmer LS-5B luminescence spectrometer, with 
A ex =3 65 nm and A em =450 nm. Bgal activities are given as 
fluorescence units per fig protein assayed. 

C. Specific Binding of the EcR Protein to Ecdysone 
Response Elements 

The simplest explanation of the results described in 
the preceding section is that the EcR protein generated 
by the EcR expression plasmid binds to the EcRE of the 
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reporter plasmid and, in combination with ecdysone, 
activates the minimal Adh promoter in that plasmid. The 
following experiment was designed to test whether the EcR 
protein exhibits specific binding to this EcRE in vitro . 
5 Two plasmids were used: pUC18, which serves as the 

control, and pUC18-EcRE, which was generated by 
substituting the Hindll-Xbal fragment from pEcRE/Adh/figal 
that contains the seven repeats of the 34 bp EcRE 
oligonucleotide, for the Hindll-Xbal fragment of pUC18. 

10 Because the only difference between these two fragments 

is the seven oligonucleotide repeats, this is also the 
only difference between the two plasmids. 

The two plasmids were digested with ApaLI and Hind 
III, end-labeled with 32 P and mixed with an extract from 

15 MtEcRHy cells in which the EcR protein was overexpressed 

by Cu 2+ induction (see section A, above) . After a 
15-minute incubation at 25 # C to allow EcR-DNA binding to 
occur, affinity-purified anti-EcR antibody was added. 
The 25 'C incubation was continued for an additional 

20 40 minutes, at which time anti-rabbit Ig-coated magnetic 

beads (Dupont Magnasort-R) were added, and the incubation 
continued 15 minutes more. The beads were separated from 
the solution magnetically, similarly washed, and the DNA 
eluted from the beads in 1% SDS at 65 "C. The eluted DNA 

25 was ethanol precipitated and fractionated by 

electrophoresis in an agarose gel, which was dried and 
autoradiographed . 

Only the fragment containing the EcRE 
oligonucleotide was specifically and efficiently 

30 registered on the autoradiographs , and that registration 

was dependent upon the anti-EcR antibody. Quantitative 
analysis of the autoradiographs demonstrated a I0 3 -fold 
preference for binding to the EcRE oligonucleotide over 
the average vector sequences, under the conditions of 

35 this assay (see Methods, below) . 
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According to the criteria stated at the beginning of 
this Experimental Example, the EcR protein clearly 
satisfies the definition of an ecdysone receptor. 

Methods 

Conditi ons for the DNA binding assay 

A quantity of 0.2 fmole of digested, labelled 
plasmid DNA was mixed with 2 fig (dl/dC) in 10 pi of TE 
(10 mM Tris-HCl, pH 8.0, 1 mM EDTA) , and 90 fil of the 
MtEcRHy extract, diluted to 0.9 mg/ml in EcR buffer 
adjusted to 180 mM KC1, was added. After binding for 
15 minutes at 25°C, 2 ml of affinity-purified anti-EcR 
antibody, diluted 1.5x in EcR, was added, and this 
incubation was continued at 25 'C for 40 minutes, when 
50 fj.1 of anti-rabbit Ig-coated magnetic beads (Dupont 
Magnasort-R) , exchanged into 180 mM KC1 EcR buffer, was 
added and the incubation continued for 15 minutes. 

The beads were washed twice in 400 fil 180 mM KC1 EcR 
buffer, and DNA was eluted from the beads by soaking 
twice in 200 nl 1% SDS in TE at 65 'C. The eluted DNA was 
ethanol precipitated and run on an agarose gel, which was 
dried and autoradiographed. As controls, one half of the 
input DNA (0.1 fmole) was run on the gel for comparison, 
and the binding assay was carried out, leaving out the 
antibody . 

EXAMPLE IV 
RECEPTOR GENE MUTAGENESIS. 

Mutations in the steroid receptor superfamily genes 
can alter their function in two ways. Most obviously, 
they alter the sequences encoding the receptor proteins 
and thus alter the receptor function. Alternatively, 
they can alter the expression of these genes — an 
alteration that can be at any level of that expression 
from transcription of the gene to the translation of its 
mRNA(s) . Such mutations can change the timing of gene 
expression during development or change the tissue and 
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cell distribution of that expression, thus, profoundly 
changing the course of development. Furthermore, these 
mutations provide information about the regulation of 
receptor gene expression, just as mutations that alter 
5 the structure of the receptors encoded by these genes 

provide information about the genes whose expression 
these receptor proteins control. In particular, 
mutations that alter receptor gene expression can lead to 
the identification of the proteins and other regulatory 

10 molecules that control their expression. Clearly, 

mutagenesis of insect steroid receptor superfamily genes 
provides an important avenue leading to an ability to 
interfere in a highly specific manner with insect 
development, and thus to control insect infestations 

15 deleterious to human health and agriculture. 

We have carried out mutagenesis experiments for two 
Drosophila members of the steroid receptor superfamily 
genes, E75 and EcR, that we have cloned and characterized 
with respect to their expression. In this experimental 

20 example, mutagenesis of the E75 gene is described. 

A. Deletion Mutations 

In Drosophila , genetic analysis for a given locus - 
- in this case, the early puff locus at 75B that houses 

25 the E75 gene — generally depends upon the isolation of 

deletions of all or part of that locus. Such deletions 
greatly facilitate the subsequent isolation of point and 
other small mutations within the locus. By isolating 
mutations that are revertants to the neighboring dominant 

30 Wrinkled (W) mutations, we have isolated and molecularly 

mapped the boundaries within our chromosomal walk (see 
Experimental Example I) of two deletions, W R4 and W R1 °, 
generated by gamma ray mutagenesis, the preferred way of 
generating such large alterations of genomic structure. 

35 One of these, W R1 °, extends distally from Wrinkled to 

cover the entire E75 gene; and the other, W R4 , extends to 
a point about 90 kb upstream of the 5' end of the 50 kb 
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E75A transcription unit and does not include the E75 
gene. 

An F2 screen was then employed to screen for gamma 
ray-induced mutations mapping to the 200 kb distal region 
5 that is included in the W R1 ° deletion but not the W R4 

deletion. This screen resulted in the isolation of five 
members of a single lethal complementation group that 
molecular mapping data demonstrate represents the E75 
gene. The most useful of these five mutations is the 

10 E75 X 48 mutation. Molecular mapping of this mutation 

demonstrated that it is a 105 kb region that includes all 
of the E75 gene. This method provides an extremely 
efficient screen for other E75 mutations, i.e., by 
screening for mutations that cannot complement this 

15 deletion mutation. 

B. E75 Mutations Generated by Ethyl Methane 
Sulfonate 

20 

The chemical mutagen ethyl methane sulfonate (EMS) 
was used for this screen, as it is the preferred method 
for generating point or small mutations. An F2 screen of 
15,000 lines resulted in the isolation of 23 penetrant 

25 mutations within the 105 kb region of the E75 X48 deletion, 

all of which turned out to be alleles of E75 . It appears 
that this 105 kb region was saturated by this screen with 
respect to lethal complementation groups, and hence, E75 
appears to be the only lethal complementation group in 

3 0 this region. Adding the five E75 mutations described 

above, a total of 28 penetrant E75 alleles have thus been 
isolated, several of which are temperature-sensitive 
alleles. 

Inter se complementation studies among these alleles 
35 and examination of their phenotypes reveal a complex 

complementation group — a complexity that probably 
results from the fact that the E75 gene contains two 
overlapping transcription units, a 50 kb E75A unit and a . 
20 kb E75B unit that occupies the 3 1 end of the E75A unit 
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(see Experimental Example I and Table 1) . These alleles 
can be roughly divided into two groups: (1) those that 
cause lethality in early development, during the latter 
part of embryogenesis or during early larval development, 
5 and (2) those that cause lethality late in development, 

during the prepupal or pupal stages. 

This division correlates with the stages when the 
E75A and E75B units are expressed. Thus, E75A 
transcription is associated with each of the six pulses 

10 of ecdysone, including those that mark the embryonic and 

early larval stages. By contrast, E75B mRNAs are not 
observed until the end of the last larval stage, being 
particularly abundant during the pupal stage. This 
correlation invites the speculation that the early lethal 

15 mutations affect the expression of the E75A unit and its 

E74A protein, and that the late lethal mutations 
specifically affect the expression of the E75B unit and 
its E75B protein. This proposition can be tested by 
detailed molecular mapping of these mutations and further 

20 examination of their phenotypes at the molecular level to 

determine the causes of lethality. 

The mutants described here provide a foundation for 
the further genetic analysis of the E75 gene that will 
allow exploration of the reguirements for appropriate E75 

25 expression and function and will identify structural and 

functional domains of E75. Some of the future E75 
studies will best be performed by its in vitro 
manipulation, followed by transformation of the 
constructs back into Drosophila. Finally, it will be 

3 0 desirable to identify interacting genetic loci — 

interactions that may occur at the level of regulation of 
E75 expression or at the level of interaction of the E75 
proteins with those encoded by other genes. Such 
interactive genetic loci can be identified via the 

35 isolation of mutations that act as suppressors or 

enhancers of the E75 mutations. 
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Methods 

Strains, markers and chromosomes 

For this aspect of the invention, the following 
strains, markers and chromosomes were used. Tu 2 was 
described by Lindsley (Lindsley, 1973. DIS 50:21). All 
other strains and mutations are as described (Lindsley, 
and Grell, 1968. Genetic Variation of Drosophila 
melanocraster . Publication 627, Carnegie Institute of 
Washington, Washington, DC) . ru h W R4 e s ro ca was 
constructed by recombination between ru h W R4 sbd 2 Tu 2 and 
st sbd 2 e s ro ca. The st in ri p_ p sbd 2 chromosome was 
constructed by recombination of st in ri e p with sbd 2 , in 
order to allow marking of this chromosome over W R4 and 
W R1 °, and homozygosed by crossing to TM3, backcrossing to 
TM3, and mating of isogeneic sibling progeny. The st p_ p 
ell line was homozygosed by standard ionic procedures. 
Anto w and ns Rc4 are described in Scott et al. (1984) 
Proc. Nat' l Acad. Sci. USA 81:4115-4119. The pupal 
lethals X19, g26, 013B . 8ml2 . iX-14 . 2612 . m45 . p4., a30L . 
mz4l6, 13mll5 , 052 and wcf49 are described in Shearn 
(1974) Genetics 77:115-125. All strains used to 
construct the strains described above and other strains 
were obtained from the Bowling Green and Caltech stock 
centers . 

TM1, TM3 and TM6B (Lindsley, and Grell, 1968. 
Genetic Variation of Drosophila melanogaster F Publication 
627, Carnegie Institute of Washington, Washington, DC) 
are balancer chromosomes carrying recessive lethal 
mutations along with multiple inversions to suppress 
recombination. This allows the maintenance, as a 
heterozygote, of a recessive lethal chromosome in its 
original state. These chromosomes are also marked with 
convenient visible markers. 



Quantitative Southern blot mapping for detection of 



mutant lesions 
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DNA was prepared from adult flies (about 50) by 
douncing in 1 ml of 10 mM Tris-HCl, pH 7.5, 60 mM NaCl, 
10 mM EDTA, 0.15 mM spermine, 0.2 mg/ml proteinase K. 
The homogenate was added to an equal volume of 0.2 M 
5 Tris-HCl, pH 9.0, 30 mM EDTA, 2% SDS, 0.2 mg/ml 

proteinase K, incubated at 37 °C for 1 hour, and then 
extracted twice with buffer-saturated phenol and once 
with 24:1 chloroform/isoamyl alcohol. DNA was EtOH 
precipitated twice, hooking the pellet out without 

10 centrifugation. Southern blot hybridization was as 

described (Segraves, W. et al., 1984. J. Mol. Biol. 
175:1-17). Where restriction fragment length 
polymorphism was not used in order to distinguish the 
parental chromosome from the balancer chromosome, 

15 quantitation of band intensity on genomic Southerns was 

achieved using a scanning densitometer. By using a 
control probe outside the mutant region, the amount of 
DNA in each track was internally controlled. Comparison 
of deficiency heterozygote to wild type bands, when 

20 normalized to a control band in this way, gives little 

deviation from the expected 1:2 ratio. 

Molecular cloning of mutant lesions 

Restriction fragments of the appropriate size were 

25 isolated by preparative low melting agarose (FMC) 

electrophoresis of about 2 0 /ig of restricted genomic DNA. 
The 6 kb W R4 Xhol fragment was cloned into Xhol-cleaved 
ASE6DBam which is propagated as a plasmid in order to 
grow the vector and cannot be packaged without an insert. 

30 The 18 kb W R1 ° Sail fragment was cloned into the Sail site 

of AEMBL3, cleaved also with EcoRI for the biochemical 
selection method for the prevention of propagation of 
non-recombinant clones. The 7 kb EcoRI fragment 
containing the x37 breakpoint was cloned into EcoRI- 

35 cleaved A 6 07. Plating of recombinants on the hflA strain 

RY1073 prevented plaque formation by non-recombinant 
phage. The 14 kb x48 EcoRI fragment was cloned into the 
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EcoRI site of AEMBL4 , which had been cleaved with BamHI 
to utilize the "biochemical selection" for recombinants. 
The breakpoint fragments of x44 and the recipient 
fragment were cloned into ASE6ABam. Libraries were 
packaged using A in vitro packaging extracts prepared as 
described in Hohn (Hohn, B. , 1979. Methods Enzymol. 
68:299-303). After demonstration that each of the 
libraries gave a significant number of plaques only when 
inserts were included in the ligation, they were screened 
using restriction fragments capable of detecting the 
breakpoint clones. 

Gamma ray mutagenesis 

Adult males of the strain ru h W sbd 2 Tu 2 or st in ri 
p_ p sbd 2 were irradiated in plastic vials with 5000 rad of 
gamma rays from a Cs 137 source at a dose rate of 4300 
rad/minute. These were then mated to virgins of the 
appropriate strain, which were allowed to lay eggs for 
five days. 

EMS mutagenesis 

The primary lesion in EMS-induced mutations of 
bacteria and yeast is an alkylation-induced transition of 
guanine to adenine; most EMS-induced point mutations in 
Drosophila can similarly be explained on this basis. 
This change would be expected to convert, on the 
complementary strand, a C in the opa repeat element to a 
T, creating an in-frame stop codon (CAGCAA to UAGCAA or 
CAGUAA) . (Ethylnitrosourea, ENU, which has been reported 
to yield a higher number of mutations for a given amount 
of sterility, is also an alkylator; however, considerably 
more stringent precautions must be taken in handling this 
mutagen . ) 

EMS was administered at 0.025 M to unstarved 1.5-5 
day-old males in 1% sucrose solution (1.5 ml on two slips 
of Whatman #2 in a 350 ml milk bottle) . Starvation of 
the males for 8 hours before EMS administration resulted 
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in unacceptable levels of sterility, and males of the st 
E p strain readily fed upon the EMS/sucrose solution 
without starvation. Mutagenesis was monitored by 
crossing mutagenized males to attached-X FMA3 females. 
5 Other mutants seen in this screen included a large number 

of ca alleles (many mosaic) seen over TM6B in the Fl and 
F2 generations, a dominant brown allele, and two new 
mutants, Wink , a third chromosome dominant mutation 
resembling Bar , and a third chromosome dominant Curlv - 

10 like mutation. Wink is easily scored (RK1) , has complete 

penetrance, and is quite healthy over TM6B. 

In the initial screen, vials were scored as mutant 
if they had fewer than 25% as many deficiency 
heterozygote as balancer heterozygote flies. On 

15 retesting, this was revised to 50% of the level seen in 

control crosses. Balancer heterozygotes were 
approximately two-thirds as viable as deficiency 
heterozygotes . 

20 In situ hy bridization and cvtological analysis 

In situ hybridization of polytene chromosomes was 
carried out as described in Experimental Example I (see 
Methods, section A) . Cytological analysis was performed 
by squashing larval salivary glands in lactoacetic orcein 

25 (2% orcein, 50% acetic acid, 30% lactic acid) . 

Although the present invention has been described in 
some detail by way of illustration and example for 
purposes of clarity of understanding, it will be obvious 
that certain changes and modifications may be practiced 

3 0 within the scope of the claims. 
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ACTTACTAGTGAAAAACATCATAATAAACA^TTGCCAAAAAAAATCCAATGAAATTGACA 



CTTATCTTAAAAAAATAGGTGAGATTGTAACCGTTGATGTAC^^ 



ACTGATTTCGJS&GiaGG^C^TCC^ 
TGCGTEGATGGCAGCACaTGTTCGAAGT CCC^^ 



ATCCTCGGCTCCGCTTGTTCCCTTCC^ 



TTTAAQCAGTGTAGTGAGTGCCCCSTGAAAAQGATAACCCAAAAAC^ 

*211 

TTTTATCAGTCvrXjAAGAAAACAjrg.TAAACTTGGCTCAAAAAGG^ 



GAAGGATAAAATAATATCGCACCAGTCCTTCAAAAArc 



AAATTTTTTTAATAAATATTCCAAAAATATTCCCC^^ 
380 

' ATC TTA ATG TCC GOG GAC AGT TCA GAT AGC GCC AAG ACT TCT GTG ATC TGC AGC 
MET Leu MET Ser Ala Asp Ser Ser Asp Ser Ala Lys Thr Ser Val lie Cys Ser 

ACE GTG AGT GCC AQC ATG CTA GCA CCA CCA GCT CCA GAA CAG CCC AGC ACC ACA 
Thr Vai Sor Ala Ser MET Lou Ala Pro Pro Ala Pro Glu Gin Pro Ser Thr Thr 



GCA CCA CCC ATT TTG GGG GTA ACA GGT CGA TCT CAC CTG GAA AAT GCC CTG AAA 
Ala Pro Pro lio Lou Giy Val Thr Gly Arg Sor His Lau Glu Asn Ala Lou Lys 



CTA CCG CCA AAC ACA AGT GTT TCG GCT TAC TAC CAG CAC AAC AGC AAG CTG GGC 
Leu Pro Pro Asn Thr Ser Val Sar Ala Tyr Tyr Gin His Asn Ser Lys Leu Giy 



ATG GGC CAG AAT TAC AAT CCG GAA TTC AGG AGC CTG GTA GCA CCT GTC ACA GAT 
MET Giy Gin Asn Tyr Asn Pro Glu Phe Arg Ser Leu Val Ala Pro Val Thr Asp 



CTG GAT ACT GTG CCA CCC ACA GGT GTG ACC ATG GCG AGT TCT TCG AAT TCT CCC 
Leu Asp Thr Val Pro Pro Thr Gly Val Thr MET Ala Ser Ser Ser Asn Ser Pro 

108 

AAC TCC TCC GTC AAG CTG CCC CAC AGC GGC GTG ATC TTT GTC AGC AAA TCG AGT 
Asn Ser Ser Vai Lys Lou Pro His Sar Gly Val lie Phe Val Sor Lys Sor Sar 



GCC GTC AGC ACC ACC GAT GGT CCC ACT GCA GTG TTG CAA CAG CAG CAG CCG CAG 
Ala Val Sor Thr Thr Asp Gly Pro Thr Ala Val Lau Gin Gin Gin Gin Pro Gin 



CAG CAA ATG CCC CAG CAC TTC GAG TCC CTG CCC CAC CAC CAC CCC CAG CAG GAA 
Gin Gin MET Pro Gin His Phe Glu Ser Lou Pro His His His Pro Gin Gin Glu 



AAAACTGA^t^AG^TCCGCCGCGAATGTTGATTTTC^ 




812 
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CAG CAA ATG CCC CAG CAC TTC GAG TCC CTG CCC CAC CAC CAC CCC CAG CAG -AA 

Gin Gin MET Pro Gin His Pho Glu Ser Lou Pro His His His Pro Gin Gin Glu 

CAC CAG CCA CAG CAG CAG CAG CAA CAA CAT CAC CTT CAG CAC CAC CCA CAT CCA 

His Gin Pro Gin Gin Gin Gin Gin Gin His His Lou Gin His His Pro His Pro 



CAT GTG ATG TAT CCG CAC GGA TAT CAG CAG GCC AAT CTG CAC CAC TCG GGT GGT 
His Val MET Tyr Pro His Giy Tyr Gin Gin Ala Asn Leu His His Sor Gly Gly 



ATT GCT GTG 
Ilo Aia Val 



GTT COG GCG GAT TCG CGT 
Val Pro Ala Asp Ser Arq 



CCC CAG ACT CCC GAG TAC ATC AAG TCC 
Pro Gin Thx Pro Glu Tyr He Lys Ser 
216 



TAC CCA GTT 
Tyr Pro Val 



ATG GAT ACA ACT GTG GCT 
MET Asp Thr Thr Val Ala 

■lntron 1 . . TTCTTTGCAG 



1082 Y 
AAC ATA GAA 
Asn lie Glu 



TCC GGT TTC 
Sor Gly Pho 



TTC GAT GGC ACC ACA GTG 
Pho Asp Gly Thr Thr Val 



CAT TAC GGC GTG CAT TCC 
His Tyr Gly Val His Ser 



AGT TCG GTA AAG GGG GAA CCA GAA CTC 
Sor Ser Val Lys Gly Glu Pro Glu Lou 



CTG TCC CCC GTT TGC CGG GAT AAG GCC 
Lou Cys Arq Val Cys Gly Asp Lys Aia 



GTAAGTTCGT. .int 



^ron ^2. . ATCGTT TCAG 



TGC GAG GGT TGC AAG GGA TTC TTC CGC 
Cys Glu Giy Cys Lys Giy Pho Pho Arq 



CCC TCC ATC 
Arq Sor Ilo 



CAG CAA AAG ATC CAG TAT 
Gin Gin Lys He Gin Tyr 



CGC CCG TGC ACC AAG AAT CAG CAG TGC 
Arq Pro Cys Thr Lys Asn Gin Gin Cys 



AGC ATT CTG CGC ATC AAT CGC AAT CGT TGT CAA TAT TGC CCC CTG AAA AAG TGC 
Sor lie Lou Arq lie Asn Arq Asn Arq Cys Gin Tyr Cys Arq L»u Lya Lys Cys 

G TGAGTACC T . . lntron 3 . . CCAATTGCAG 



V 

ATT GCC GTG GGC ATG AGT CGC GAT GCT GTG CGT TTT GGA CGC GTG CCG AAG CGC 

lie Aia Val Giy MET Sor Arq Asp Ala Val Arq Pho Giy Arq Val Pro Lys Arq 

1 352 

CAA AAG GCG CGT ATC CTG GCG GCC ATG CAA CAG AGC ACC CAG AAT CGC GGC CAG 

Giu Lys Ala Arq lio Lou Ala Ala MET Gin Gin Ser Thr Gin Asn Arq Gly Gin 

CAG CGA GCC CTC GCC ACC GAG CTG GAT GAC CAG CCA CGC CTC CTC OCC GCC GTG 

'Jin Arq Ala Lou Ala Thr Glu Lou Asp Asp Gin Pro Arq Lou Leu Ala Ala Val 

CTG CGC GCC CAC CTC GAG ACC TGT GAG TTC ACC AAG GAG AAG GTC TCG GCG ATG 

Leu Arq Ala His Lou Giu Thr Cys Giu Pho Thr Lys Giu Lys Val Sor Ala MET 



g TAAGTCTCA. .1 



r.trcn 4 . . ATTTC' 



CGG CAG CGG GCG CGG GAT TCC CCC TCC TAC TCC ATG CCC ACA CTT CTG GCC TGT 
Arq Gin Arq Ala Arq Asp Cys Pro Sor Tyr Ser MET Pro Thr Lou Leu Ala Cys 
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CCS CTG AAJ u~C GCC CCT GAA~ CTG CAA 7CG GAG LAG GAG TTC T~C CA 

Pro Leu Asn Pro Ala Pro Glu i« Gin Ser Giu Gin Glu PfM Ser c7n £0. Pm 
1622 

C^ CAC GTA ATT «E «T GTG ATC GAC TTT CCC GCC ATG A" CC m- — ~»r 
Ala Hi, VI II. An, Gly Vai „. Asp Phe AU Gly ^ J£ «J — ^ 



432 



CTG CTC ACC CAG GAC GAT AAG TTC ACG CTC CTG AAG GCG GGA CTC — ~, 

Leu Thr Gin Asp Asp Lys Ph. Thr Leu Leu Lys Ala G.y S PnQ £ ^ 

CTG TTT GTG COC CTG ATC TGC ATG TTT GAC TCG TCG ATA AAC -r» t ^ w w 
Leu Phe Va, An, Leu lie Cys MET Pne Asp Ser IS J2 £n S JS £ SI 

CTA AAT GOT CAG GTG ATG CGA COS GAT GCG ATC CAG AAC GGA GCC AAT rr 

"° U AS " C * Gin Vai ** Arg Arg Asp Ala lie Gin Asn Gly Aia Asn Ala Arg 



486 



JS 2 51? £ E £ ™ K S S S 5? 25 2 S IE 2 



1892 



ACA GAT GCC GAG ATA GST CTG TTC TGC GCC ATC GTT CTG ATT ACG CCG GA- — 
-hr Asp Ala Glu lie Gly Leu Phe Cys Ala lie Val Leu He Thr Pro Aso £g" 

CCC GGT TTG CGC AAC CTG GAG CTG ATC GAG AAG ATG TAC TCG CGA CTC AAG GGC 
Pro Gly Leu Arc, Asn Leu Glu Leu lie Glu Lys MET Tyr Ser Arg Leu Lys 

540 

TGC CTG CAG TAC ATT GTC GCC CAG AAT AGG CCC GAT CAG CCC GAG TTC CTG GC<- 
Cys Leu Gin Tyr He Vai Ala Gin Asn Arg Pro Asp Gin pS Glu £u Ea 

AAG TTG CTG GAG ACG ATG CCC GAT CTG CGC ACC CTG AGC ACC CTG CAC AC- CAC 
Lys Leu Leu Glu Thr MET Pro Asp Leu Arg Thr Leu Ser £r £s S Su 

AAA CTG GTA GTT TTC CGC ACC GAG CAC AAG GAG CTG CTG CGC CAG CAC ATr 
Lys Leu Val Val Phe Arg Thr Glu His Lys Giu Leu Leu Arg Gin Gin MET Tro 
2162 594 

S« mS SS f C J* AGC ^ ^ CAG CAG AAC AAG TCG CCC TCG C^C 

Ser MET Glu Asp Gly Asn Asn Ser Asp Gly Gin Gin Asn Lys Ser Pro Ser 

a?* AAG AGT CCG CT~ GGC T~G —A 

S«r Trp Ala Asp Ala MET Asp Val Glu Ala Ala Lys Ser Pro Leu Gly S Val 

c^" GGC AGT CCG AGC AGT TCG CAG CCA 

Ser Ser Thr Glu Ser Ala Asp Leu Asp Tyr Gly Ser Pro Ser Zr Se~r Sn So 



648 



Gin G\v ul? CCS CAG CAA CAG CCC TCG CTG GCC AGC 

Gin Gly Val Ser Leu Pro Ser Pro Pro Gin Gin Gin Pro Ser Ala Leu Ala Ser 

TCG GCT CCT CTG CTG GCG GCC ACC CTC TCC GGA GGA TGT CCC CTG CGC AAC CGC 

Ser Ala Pro Leu Leu Ala Ala Thr Leu Ser Gly Gly Cys Pro Leu Arg Asn Arg 
2432 

GCC AAT TCC GGC TCC AGC GGT GAC TCC GGA GCA GCT GAG ATG GAT ATC GTT GCT 

Ala Asn Ser Gly Ser Ser Gly Asp Ser Gly Ala Ala Glu MET Asp JS Si Sy" 



702 



97 
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2432 

GCC AAT TCC GGC TCC AGC GGT GAC TCC CGA GCA GCT GAG ATG GAT ATC GTT CGC 
Ala Asn Sor Gly Ser Ser Gly Asp Sor Gly Ala Ala Giu MET Asp He val Gly 



TCG CAC GCA CAT CTC AOC CAG AAC GGG CTG ACA ATC ACG CCG ATT GTG CGA CAC 
Ser His Ala His Lou Thr Gin Asn Gly Leu Thr He Thr Pro He Val Arg His 

GJ AGTATCTT. . lr .t ro n 5 . . TTTCTTAC ^S 

NT 

CAG CAG CAG CAA CAA CAG CAG CAG CAG ATC GGA ATA CTC AAT AAT GCG CAT TCC 
Gin Gin Gin Gin Gin Gin Gin Gin Gin He Gly He Leu Asn Asn Ala His Ser 



CGC AAC TTG AAT GGG GGA CAC GCG ATG 
Arg Asn Leu Asn Gly Gly His Ala MET 



CTG CAC CAC CAC TTG ACA GCC GGA GCT 
Leu His His His Leu Thr Ala Gly Ala 

2702 

ACG GAT TOG GGC ATT GAG TOG GGC AAC 
Thr Asp Ser Gly He Glu Ser Gly Asn 



TGC CAG CAA CAG CAG CAG CAC CCA CAA 
Cys Gin Gin Gin Gin Gin His Pro Gin 
756 

G (Dm4 9Z5) 

GCC CGC TAG AGA AAG CTA GAT TCG CCC 
Ala Arg Tyr Arg Lys Lau Asp Ser Pro 
Arg 

GAG AAG AAC GAG TGC AAG GCG GTG ACT 
Glu Lys Asn Giu Cys Lys Ala Val Ser 



TCG GGG GGA AGT TCC TCG TGC TCC AGT CCG CGT TCC AGT GTG GAT GAT GCG CTG 
Ser Gly Gly Ser Ser Ser Cys Ser Ser Pro Arg Ser Ser Val Asp Asp Ala Leu 

810 

GAC TGC AGC GAT GCC GCC GCC AAT CAC AAT CAG GTG GTG CAG CAT CCG CAG CTG 
Asp Cys Ser Asp Ala Ala Ala Asn His Asn Gin Val Val Gin His Pro Gin Leu 



AGT GTG GTG TCC GTG TCA CCA GTT CGC TCG CCC CAG CCC TCC ACC AGC AGC CAT 
Ser Val Vai Ser Val Ser Pro Val Arg Ser Pro Gin Pro Ser Thr Ser Ser His 



CTG AAG CGA CAG ATT GTG GAG GAT ATG CCC GTG CTG AAG CGC GTG CTG CAG GCT 
Leu Lys Arg Gin He Val Glu Asp MET Pro Vai Leu Lys Arg Val Leu Gin Ala 

S64 

2972 

CCC CCT CTG TAC GAT ACC AAC TCG CTG ATG GAC GAG GCC TAC AAG CCG CAC AAG 
Pro Pro Leu Tyr Asp Thr Asn Ser Leu MET Asp Glu Ala Tyr Lys Fro His Lys 



AAA TTC CCG GCC CTG CCG CAT CGC GAG TTC GAG ACC GCC GAG CCG GAT GCC AGC 
Lys Phe Arg Ala Leu Arg His Arg Glu Phe Giu Thr Ala Glu Ala Asp Ala Sor 



AGT TCC ACT TCC GGC TCG AAC AGC CTG AGT GCC GGC AGT CCG CGG CAG AGC CCA 
Ser Ser Thr Ser Gly Ser Asn Ser Leu Ser Ala Gly Ser Pro Arg Gin Ser Pro 

918 

GTC CCG AAC AGT GTG GCC ACG CCC CCG CCA TCG GCG GCC AGC GCC GCC GCA GGT 
Val Pro Asn Ser Val Ala Thr Pro Pro Pro Ser Ala Ala Ser Ala Ala Ala Gly 



AAT CCC GCC CAG AGC CAG CTG CAC ATG CAC CTG ACC CGC AGC AGC CCC AAG GCC 
Asn Pro Ala Gin Ser Gin Leu His MET His Leu Thr Arg Ser Ser Pro Lys Ala 
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3242 

TCG ATG GCC AGC TCG CAC TCG GTG CTG GCC AAG TCT CTC ATG GCC GAG CCG CCC 
Ser MET Ala Ser Ser His Ser Val Leu Aia Lys Ser Leu MET Ala Giu Pro Are 

972 

ATG ACG CCC GAG CAG ATG AAG CGC AGC GAT ATT ATC CAA AAC TAG TTG AAG CGC 
MET Thr Pro Glu Gin MET Lys Arq Sar Asp He He Gin Asn Tyr Leu Lys Arq 



GAG AAC AGC ACA GCA GCC AGC AGC ACC ACC AAT GGC GTG GGC AAC CGC AGT CCC 
Glu Asn Ser Thr Ala Ala Ser Ser Thr Thr Asn Cly Vai Gly Asn Arq Ser Pre 



AGC AGC AGC TCC ACA CCG CCG CCG TCG GCC GTC CAG AAT CAG CAG CGT TCG GGC 
Ser Ser Ser Ser Thr Pro Pro Pro Ser Ala Val Gin Asn Gin Gin Arq Trp Gly 

1026 

AGC AGC TCG GTG ATC ACC ACC ACC TGC CAG CAG CGC CAG CAG TCC GTG TCG CCG 
Ser Ser Ser Val He Thr Thr Thr Cys Gin Gin Arq Gin Gin Ser Val Ser Pro 

3512 

CAC AGC AAC GGT TCC AGC TCC AGT TCG AGC TCT AGC TCC AGC TCC AGT TOG TCA 
His Ser Asn Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser 



TCC TCC TCC ACA TCC TCC AAC TGC AGC TCC AGC TCG GCC AGC AGC TGC CAG TA? 
Ser Ser Ser Thr Ser Ser Asn Cys Ser Ser Ser Ser Ala Ser Ser Cys Gin Tyr 

1083 

TTC CAG TCG CCG CAC TCC ACC AGC AAC GGC ACC AGT GCA CCG GOG AGC TCC AGT 
Phe Gin Ser Pro His Ser Thr Ser Asn Gly Thr Ser Ala Pro Ala Ser Ser Ser 



TCG GGA TCG AAC AGC GCC ACG CCC CTG CTG GAA CTG CAG GTG GAC ATT GCT GAC 
Ser Gly Ser Asn Ser Ala Thr Pro Leu Leu- Glu Leu Gin Val Asp He Ala Asp 



TCG OCG CAG OCT CTC AAT TTG TCC AAG AAA TCG CCC ACG CCG CCG CCC AGC AAG 
Ser Aia Gin Pro Leu Asn Leu Ser Lys Lys Ser Pro Thr Pro Pro Pro Ser Lys 

3782 

CTG CAC GCT CTG GTG GCC GCC GCC AAT GCC GTT CAA AGG TAT CCC ACA TTG TCC 
Leu His Aia Leu Val Aia Ala Aia Asn Ala Val Gin Arq Tyr Pro Thr Leu Ser 



GCC GAC GTC ACA GTG ACA GCC TCC AAT GGC GGG TCC TCC GTC GGC GGC GGC GAG 
Aia Asp Val Thr Val Thr Ala Ser Asn Gly Gly Ser Ser Val Gly Gly Gly Glu 



TCC GGC CGC CAG CAG CAG TCC GCC GGC GAG TGT GGG CTC CCC CAA TCC GGG CCT 
Ser Gly Arq Gin Gin Gin Ser Ala Gly Glu Cys Gly Leu Pro Gin Ser Gly Pro 

1186 

GAG CGC CGC CGT GCA CAA GGT AAT GCT GGA GGC GTA AGA GCC GGA GGA GGT AGG 
Glu Arq Arq Arq Ala Gin Gly Asn Ala Gly Gly Val Arq Ala Gly Gly Giy Arq 



TGG TTT TAC OCG GAG AAG TGG GAG AGA CAG AGA CTG GGA GTG GCA GTT CAG CGA 
Trp Phe Tyr Ala Glu Lys Trp Glu Arq Gin Arq Leu Gly Val Ala Vai Gin Arq 

4052 

AGC AGG AAG CAG GAT CAC TTG GAG CGG CGG GAG TTG AAT TAA 
Ser Arq Lys Gin Asp His Leu Glu Arq Arq Glu Leu Asrs 

1237 
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4052 

AGC AGG AAG CAG GAT CAC TTG GAG CGG CGG GAG TTG AAT TAA 
3ar Axg Lya Gin Asp Ria Lou Giu Arg Arg Giu Leu Asn^ . 

ATTATTTTACCATT7AATTCAGACGTCTACAAAGTTTG 

7AATATTTAAAGCAACAACAAACAAAACAACTACAAGTTATTAATTTAAAA 
4234 

AACAAAAAACOC AAGCTTGAATGGTA TTAC AAAAGAAAAAGAAAAACAGAAAAAATA TAAATATA TT TTA 

GCAGTTAAACTTTAACGTAGCAAGAAACCAACAAACCCAAGGCAGCGCTCTGATTTCGCATTAA^ 

4374 _ 

TTCAGCTGCTACCGAAAAOCXrCCTCACCTCCCCGCCACCCAACCCT 

CXjACCCCTGATTGTTTTATAAGTTTTAAGCTCTTGTTGTACAT^ 
4514 

TTAGCGCTTTAGTTGTAGTTOGAGCA^ 

-ATTATT ATTAAA TTTTTAAATACCTAAAAACAAAACA^^ 

| ^ poly A alto c£)m4927 and cDm4928 

4654 

AAGCAGAA TGAAGTGCAG TTTGCAACAAATTTTAACTACGAT TAAGTTGAT AACGAT TCAT TTTT TA TGA 

ATTTAACTAATTTTATGAATTTCTTATAGTTTTCCACCCT?CTATAGATCTTC?ATC 
4794 

^CCCCTATTCCTCATTTCTCCTTTGGCACA^ 

rGTTrTCTAATTTTAAAACTACCACAAAAATACGATTAAAATATACACG^ 
-934 

ATGCTTAAAGTTTTAGCAOCAAGCAGTAAAACGACGATC^^ 

TTGTCTrACATAGTTAAATGTTAAATTAAACACAAAAACATATTTAAA 
5074 

TTAATGAAGAAACXTTATGCTTAAAAGATTCAATGTTTGATTGGC^ 

AAAAAAAAATCAAC AAACAAAAAT TATGAT ATAT TA TT TAAAAGTAAAGTA TACA TTTACATT AC AGAAA 
5214 

AACAAAAGAGAAAACTTGCGGTAGCAACAAAACTAT TA TA TT AA TTACA TTTT AA TTA TCXT TG TACT AT T 

ATGATTATTAATTATTATGAlTAATTAATTACGATTm 
5354 

GCAAAAACCAXTJAAAAAAAAAAACAAAAAACAAGCAAAAAAT 

putativo polyaaenylatlon signal fcr long cranscripta 
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CGACCCGTTTC<»GTGAACCT^TCAGTTGGCAC^C^ TC 
-Ml 

CCTCGGCCGAGTACGAGCGGCATGO^TAATraXZ^^ 
ACi^^CCCGAAGCAACACAACACCAACACGAG^ 

1 ^ rrRNA scan sice 

TCCAGAGCAAGAAAAGCCCCCGAAAAACCAAGCAAAAA^ 

AGAGCAGAGTCAACAGCAACAAATCTTCAATAOCAAATGATATCG^ 

284 

GTG TCATTTCGAAGCCAAAAAGCAAAATCTCTAATTCAAAT ATG GTT TGT GCA ATG CAA 

MET Val Cys Ala MET C.r 

3C2 

GAG GTT GCT GCT GTG CAG CAT CAG CAG CAG CAA CAG CAA CTC CAG TTG CCC CAG 
Giu Val Ala Ala Val Gin His Gin Gin Gin Gin Gin Gin Lau Gin Leu Pro Gin 

24 

CAG CAA CAG CAG CAG CAG CAG ACA ACA CAG CAG CAA CAT GCA ACA ACG ATA GTG 

Gin Gin Gin Gin Gin Gin Gin Thr Thr Gin Gin Gin His Ala Thr Thr lie Vai 



CTC CTC ACG GGC AAT GGC GGC GCT AAT CTG CAC ATT GTC GCC ACA CCG CAA CAG 
Lou Lau Thr Giy Asn Gly Gly Giy Asn Lou His lie Val Ala Thr Pro Gin Gl-r. 



CAT CAG CCG ATG CAT CAG CTC CAC CAT CAG CAT CAG CAT CAG CAT CAG CAC CAG 
His Gin Pro MET His Gin Leu His His Gin His Gin His Gin his Gin His Glr. 

76 

CAG CAG GCC AAG AGC CAA CAG CTG AAG CAA CAA CAC TCG GCG CTG GTC AAG TTG 
Gir. Gin Ala Lys Ser Gin Gin Leu Lys Gin Gin His Ser Ala Leu Vai Lys Lq-j 

572 

CTG GAG TCG GCG CCC ATC AAG CAG CAA CAG CAG ACG CCC AAG CAA ATT GTT TAC 
Leu Giu Ser Aia Pro lie Lys Gin Gin Gin Gin Thr Pro Lys Gin lie Vai Tyr 



CTG CAG CAG CAG CAG CAG CAA CCG CAA CCC AAA AGA CTG AAA AAC GAA GCA GCA 
Leu Gin Gin Gin Gin Gin Gin Pro Gin Arg Lys Arg Leu Lys Asn Giu Ala Aid 

132 

ATC GTA CAA CAG CAA CAA CAA ACA CCT GCA ACA CTA GTA AAG ACA ACA ACC ACC 
lie Val Gin Gin Gin Gin Gin Thr Pro Ala Thr Leu Val Lys Thr Thr cnr Thr 



AGC AAC AGC AAC AGC AAC AAC ACC CAG ACA ACA AAT ACT ATT AGT CAG CAG CAA 
Ser Asn Ser Asn Ser Asn Asn Thr Gin Thr Thr Asn Sor I*e Ser Gin Gin Gi.r. 
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AGC AAC AGC AAC AGC AAC AAC ACC CAG ACA ACA AAT AGT ATT AGT CAG CAG CAA 
Ser Asn Ser Asn Ser Asn Asn Thr Gin Thr Thr Asn Ser I-e Ser Gin Gin Gin 



CAG CAG CAT CAG ATT CTG TTG CAG CAC CAG CAG CCA GCC GCG GCA OCA ACA CCA 
Gin Gin His Gin lie val Leu Gin His Gin Gin Pre Ala Aia Ala Ala Thr Pro 

18 6 

842 

AAG CCA TGT GCC GAT CTG AGC GCC AAA AAT CAC AGC GAG TCC GGC ATC GAC GAG 
Lys Pro Cys Ala Asp Leu Ser Ala Ly3 Asn Asp Ser Giu Ser Giy lie Asp Giu 



GAC TCC CCC AAC AGC GAT GAG GAT TCC CCC AAT GCC AAC CCC GCG GGC ACA TCG 
Asp Cys Pro Asn Ser Asp Glu Asp Cys Pro Asn Ala Asn Pro Ala Giy Thr Ser 



CTC GAG GAC AGC AGC TAG GAG CAG TAT CAG TGC CCC TGG AAG AAG ATA CCC TAT 
Leu Glu Asp Ser Ser Tyr Glu Gin Tyr Gin Cys Pro Trp Lys Lys lie Arq Tyr 

240 

COG CGT GAG CTC CTC AAG CAG CGC GAG TTG GAG CAG CAG CAG ACC ACC CGA GGC 
Ala Arq Glu Leu Leu Lys Gin Arg Glu Leu Giu Gin Gin Gin Thr Thr Giy Giy 



AGC AAC GCG CAG CAG CAA GTC GAG GCG AAG CCA GCT GCA ATA CCC ACC AGC AAC 

Ser Asn Ala Gin Gin Gin Val Glu Ala Lys Pro Ala Aia lie Pro Thr Ser Asn 

1112 

ATC AAG CAG CTG CAC TGT GAT AGT CCC TTT TCG GCG CAG ACC CAC AAG GAA ATC 

lie Lys Gin Leu His Cys Asp Ser Pro Pho Ser Ala Gin Thr His Lys Giu lie 



GCC AAT CTC CTG CGC CAA CAG TCC CAG CAA CAA CAG GTT GTG GCC ACG CAG CAG 
Ala Asn Leu Leu Arq Gin Gin Ser Gin Gin Gin Gin Vai Val Aia Thr Gin Gin 



CAG CAG CAA CAG CAG CAG CAG CAC CAG CAC CAG CAA CAA CGA AGG GAT AGC TCC 
Gin Gin Gin Gin Gin Gin Gin His Gin His Gin Gin Gin Arq Arq Asp Ser Ser 



GAC AGC AAC TGC TCG CTG ATG AGC AAC TCG AGC AAC TCC AGT CCG GGC AAT TGT 
Asp Ser Asn Cys Ser Leu MET Ser Asn Ser Ser Asn Ser Ser Ala Giy Asn Cys 



~OC ACC TGC AAC GCT GGC GAC GAC CAG CAG CTG GAG GAG ATG GAC GAG GCC CAC 

Cys Thr Cys Asn Ala Giy Asp Asp Gin Gin Leu Giu Glu MET Asp Giu Ala His 

1382 

GAT TCG GGC TGC CAC GAT GAA CTT TGC GAG CAG CAT CAC CAG CGA CTG GAC TCC 

Asd Ser Giy Cys Asp Asp Giu Leu Cys Giu Gin His His Gin Arg Leu Asp Ser 



TCC CAA CTG AAT "AC CTG TGC CAG AAG TTC GAT GAG AAA CTG GAC ACG GCG CTG 
.Ser Gin Leu Asn Tyr Leu Cys Gin Lys Phe Asp Giu Lys Leu Asp Thr Ala Leu 



AGC AAC AGC AGC GCC AAC ACG GCG AGG AAC ACG CCA OCT GTA ACA GCT AAC GAA 
5sr Asn Ser Ser Aia Asn Thr Giy Arg Asn Tnx Pro Aia Val Thr Aia Asn Giu 

1544 

GAT GCC GAT graggttnag 
Asp Ala Asp 



5 sss^^sss^^ssss^sssnsss^^^sssssss^ 



1 SSSSS2SSKSSJ5£SSSSS;SSSSSSSSSSSS" 



1 asssaaisggggasggasagggss aigsgisggs 
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WHAT IS CLAIMED IS : 

1. An isolated recombinant nucleic acid which, 
upon expression, is capable of coding for other than a 
native vertebrate steroid receptor or fragment thereof, 
said nucleic acid comprising a segment having a sequence 
substantially homologous to a coding region of domains A, 
B, D, E or F from an insect steroid receptor super family 
member gene having substantial homology to a steroid 
binding domain. 

2. An isolated recombinant nucleic acid of 

Claim 1, wherein said insect steroid receptor superfamily 
member is EcR, DHR3 , E75A or E75B. 

3. An isolated recombinant nucleic acid of 
Claim 1, wherein said nucleic acid encodes a polypeptide 
capable of binding to a ligand for an insect steroid 
receptor superfamily member. 

4 . An isolated recombinant nucleic acid of 
Claim 1, wherein said nucleic acid is capable of 
hybridizing to an insect steroid receptor superfamily 
member gene segment under selective hybridization 
conditions . 

5. An isolated recombinant nucleic acid of 
Claim 4, wherein said selective hybridization conditions 
are stringent hybridization conditions. 

6. A cell transformed with an isolated recombinant 
nucleic acid of Claim 1. 

7. An isolated recombinant nucleic acid having a 
sequence exhibiting identify over 20 nucleotides of a 
coding segment of an insect steroid receptor superfamily 
member having steroid binding domain homology. 

8. An isolated recombinant nucleic acid of 
Claim 7, wherein said nucleic acid encodes a polypeptide 
which binds to a control element responsive to a ligand 
of an insect steroid receptor superfamily. 

9. A cell transformed with an isolated recombinant 
nucleic acid of Claim 7. 
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10. An isolated recombinant nucleic acid comprising 
a DNA sequence capable of binding to an insect steroid 
receptor super family member other than 20-OH ecdysone 
receptor. 

11. An isolated recombinant nucleic acid of 
Claim 10, wherein said insect steroid receptor 
superfamily member is DHR3, E75A or E75B. 

12. An isolated recombinant nucleic acid of 
Claim 10, wherein said DNA sequence promotes 
transcription of an operably linked sequence in response 
to binding by said insect steroid receptor superfamily 
member. 

13. An isolated recombinant nucleic acid of 

Claim 10, wherein said DNA sequence is operably linked to 
a DNA sequence encoding a polypeptide. 

14. An isolated recombinant nucleic acid of 
Claim 10, wherein said isolated recombinant nucleic acid 
is an expression vector. 

15. A cell transformed with an isolated recombinant 
nucleic acid of Claim 10. 

16. A cell of Claim 15, wherein said cell also 
comprises said insect steroid receptor superfamily 
member . 

17. A recombinant nucleic acid comprising: 

a control element responsive to a ligand of an 
insect steroid receptor superfamily member 
ligand responsive control element; 

a non-heat shock promoter sequence; and 

a sequence comprising a reporter gene. 

18. A recombinant nucleic acid of Claim 17, wherein 
said non-heat shock promoter sequence is an alcohol 
dehydrogenase promoter. 

19. a cell transformed with a recombinant nucleic 
acid of Claim 17. 

20. A cell of Claim 19, wherein said cell is a 
mammalian cell. 
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21. A recombinant nucleic acid of Claim 17, 
comprising additional copies of control elements 
responsive to a ligand of an insect steroid receptor 
super family member. 

22. A method for monitoring expression of a 
reporter gene comprising the steps of: expressing a 
recombinant nucleic acid of Claim 17, wherein no 
radioactive reagent is used in said monitoring of said 
reporter gene. 

23. A polypeptide comprising an insect steroid 
receptor superfamily member or fragment thereof, wherein 
said polypeptide is substantially free of naturally- 
associated insect cell components and exhibits a 
biological activity characteristic of an insect steroid 
receptor superfamily member with a steroid binding 
domain. 

24. A polypeptide of Claim 23, wherein said insect 
steroid receptor superfamily member is selected from the 
group consisting of EcR, DHR3, E75A and E75B. 

25. A polypeptide of Claim 23, wherein said insect 
steroid receptor superfamily member or fragment thereof 
also comprises a DNA binding domain. 

26. A polypeptide of Claim 23, wherein said insect 
is Drosophila melanogaster. 

27. A polypeptide of Claim 23, which is capable of 
binding to a hormone analogue selected from the group 
consisting of an insect hormone and an insect hormone 
agonist. 

28. A polypeptide of Claim 27, wherein said insect 
hormone is an ecdysteroid. 

29. A polypeptide of Claim 27, wherein said insect 
hormone is 2 0-OH ecdysone. 

30. A polypeptide of Claim 23, which is capable of 
binding to a DNA control element responsive to an insect 
hormone . 

31. A polypeptide of Claim 30, wherein said 
polypeptide comprises a zinc-finger domain. 
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32. A polypeptide of Claim 30, wherein said insect 

hormone responsive DNA control element is operably linked 
to a transcription unit which is responsive to said 
binding . 

5 33. A polypeptide of Claim 32, wherein said insect 

hormone responsive DNA control element is upstream from 
said transcription unit. 

34. A polypeptide of Claim 23 fused to a second 

polypeptide . 

10 35. A polypeptide of Claim 34, wherein said second 

polypeptide is a heterologous polypeptide. 
36. A polypeptide of Claim 35, wherein said 

heterologous polypeptide comprises a second steroid 
receptor superfamily member. 

15 37. A polypeptide of Claim 23, wherein said 

fragment has a sequence substantially homologous to a 
consensus El region sequence. 

38. A polypeptide of Claim 23, wherein said 
fragment has a sequence substantially homologous to a 

20 consensus E2 region sequence. 

39. A polypeptide of Claim 23, wherein said 
fragment has a sequence substantially homologous to a 
consensus E3 region sequence. 

40. A polypeptide of Claim 23, wherein said 
25 fragment has a sequence comprising: 

a segment at least about 25% homologous to a 

consensus El region sequence; 
a segment at least about 3 0% homologous to a 

consensus E2 region sequence; and 
30 a segment at least about 30% homologous to a 

consensus E3 region sequence. 

41. A composition of matter comprising a 
polypeptide of Claim 23. 

42. A cell comprising a polypeptide of Claim 23. 
35 43. A cell of Claim 42, wherein said cell is a 

human cell. 
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44. An antibody or binding fragment thereof 
exhibiting binding specificity to a polypeptide of 
Claim 23, wherein said binding specificity is directed to 
an epitope characteristic of an insect steroid receptor 
superfamily member. 

45. A method for selecting DNA sequences capable of 
being specifically bound by an insect steroid receptor 
superfamily member, said method comprising the steps of: 

screening DNA sequences for binding to a polypeptide 

of Claim 23; and 
selecting said DNA sequences exhibiting said 

binding. 

46. A method of Claim 45, wherein said DNA sequence 
is operably linked to a gene selected from the group 
consisting of EcR, DHR3, E74 and E75 genes. 

47. A method for selecting ligands specific for 
binding to a hormone binding domain of an insect steroid 
receptor superfamily member, said method comprising the 
steps of: 

screening compounds for binding to one or more 

superfamily members; and 
selecting compounds exhibiting specific binding to 

the members. 

48. A method of Claim 47, wherein said ligand is an 
ecdy steroid. 

49. A method of Claim 47, wherein said ligand is a 
20 -OH ecdysone antagonist. 

50. A method for modulating insect physiology or 
development comprising the steps of: 

screening compounds for binding to an insect steroid 

receptor superfamily member; 
selecting said compounds exhibiting said binding; 

and 

administering to an insect said ligand. 

51. A method of Claim 50, wherein said modulating 
is lethal to said insect. 
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52. A fusion polypeptide comprising a hormone 
binding domain of an insect steroid receptor superfamily 
member and a second polypeptide. 

53. A fusion polypeptide of Claim 52, wherein said 
second polypeptide comprises a DNA binding domain from a 
second steroid receptor superfamily receptor member. 

54. A nucleic acid encoding a fusion polypeptide of 
Claim 52. 

55. A method for selecting ligands specific for 
binding to a ligand binding domain of an insect steroid 
receptor superfamily member, said method comprising the 
steps of: 

combining: 

(i) a fusion polypeptide of Claim 52, wherein 

said fusion polypeptide comprises 
said ligand binding domain 
functionally linked to a DNA binding 
domain of a second steroid receptor 
superfamily member; and 

(ii) a second nucleic acid sequence encoding a 

second polypeptide, wherein 
expression of said second nucleic 
acid sequence is responsive to 
binding by said DNA binding domain; 
and 

screening compounds for an activity of inducing 
expression of said second polypeptide; and 
selecting said compounds. 

56. A method of Claim 55, wherein said combining 
occurs within a cell. 

57. A method of Claim 56, wherein said combining 
step results from expression upon transformation of said 
cell with a nucleic acid encoding said fusion 
polypeptide. 

58. A method for modulating insect physiology 
comprising the step of: 
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administering to an insect a ligand selected by a 
method of Claim 55. 

59. A method of Claim 55, wherein said binding 
domain is selected from binding domains of insect steroid 

5 receptor superfamily members selected from the group 

consisting of EcR, DHR3, E75A and E75B. 

60. A method of Claim 58, wherein said ligand is 
lethal to said insect. 

61. An isolated receptor control element non- 
10 responsive to 20-OH ecdysone comprising a DNA segment 

capable of binding to an insect steroid receptor 
superfamily member and capable of controlling expression 
of an operably linked gene. 

62. An isolated receptor control element of 

15 Claim 61, wherein said operably linked gene is within 

about 50 kb of said control element. 

63. An isolated receptor control element of 
Claim 61, wherein said insect steroid receptor 
superfamily member is DHR3, E75A or E75B. 

20 64. A method for producing a polypeptide comprising 

the steps of: 

selecting a cell which is substantially insensitive 
to exposure to an insect steroid receptor 
superfamily ligand; 
25 introducing into said cell: 

(i) a receptor for said ligand; and 

(ii) a nucleic acid sequence encoding said 

polypeptide, said nucleic acid 
sequence operably linked to a 
30 control element responsive to 

presence of said selected 
ligand, wherein a transformed 
cell is produced; and 
exposing said transformed cell to said ligand. 
35 65. A method of Claim 64, wherein said cell is a 

mammalian cell. 
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66. A method of Claim 64, further comprising the 
step of introducing said cell into an intact organism. 

67. A method of Claim 64, wherein said cell is a 
plant cell. 

68. A method of Claim 64, wherein said ligand is 
20 -OH ecdysone. 
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